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1. REAL PARTY IN INTEREST 

The real party in interest is Genentech, Inc., South San Francisco, California, by an 
assignment of the parent application U.S. Patent Application Serial No. 09/918,585 recorded 
October 24, 2001, at Reel 012095 and Frame 0677. 

2. RELATED APPEALS AND INTERFERENCES 

The claims pending in the current application are directed to antibodies to a polypeptide 
referred to herein as "PR0274." There exists one related pending patent application, U.S. Patent 
Application Serial No. 09/978,188, filed October 15, 2001 (containing claims directed to 
PR0274 polypeptides). This related application is also under final rejection from the same 
Examiner and based upon very similar reasons, wherein appeal of these final rejections are being 
pursued independently and concurrently herewith. Although there exist several applications 
directed to the "gene amplification" utility, in general, under Appeal, none of these are related to 
PR0274 molecules or antibodies binding to it. 

3. STATUS OF CLAIMS 

Claims 58-62 are in this application. 
Claims 1-57 and 63 are canceled. 

Claims 58-62 stand rejected and Appellants appeal the rejection of these claims. 
A copy of the rejected claims in the present Appeal is provided as Appendix A. 

4. STATUS OF AMENDMENTS 

All prior amendments have been entered by the Examiner. No claim amendments have 
been submitted after the Response filed August 3, 2006. 

5. SUMMARY OF THE INVENTION 

The invention claimed in the present application concerns an isolated antibody that 
specifically binds to the polypeptide of SEQ ID NO:7 (Claim 58). The invention further provides 
monoclonal antibodies (Claim 59), humanized antibodies (Claim 60), antibody fragments (Claim 
61), and labeled antibodies (Claim 62) that specifically bind to the polypeptide of SEQ ID NO:7. 

Support for the preparation and uses of antibodies is found throughout the specification, 

including, for example, pages 217-225. The preparation of antibodies is described in 
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Example 104, while Example 106 describes the use of the antibodies for purifying the 
polypeptides to which they bind. Isolated antibodies are defined in the specification at page 132, 
lines 29-38. Support for monoclonal antibodies is found in the specification at, for example, 
page 217, line 30, to page 219, line 1 1, and Example 104. Support for humanized antibodies is 
found in the specification at, for example, page 219, line 12, to page 220, line 14. Support for 
antibody fragments is found in the specification at, for example, page 131, line 29, to page 132, 
line 22, and page 221, lines 6-34. Support for labeled antibodies is found in the specification at, 
for example, page 133, lines 1-4, and page 224, line 35, to page 225, line 4. 

The polypeptide of SEQ ID NO:7 is designated PR0274, and its amino acid sequence is 
shown in Figure 4, while the encoding nucleic acid sequence (SEQ ID NO:6) is shown in Figure 
3. The specification discloses that various portions of the PR0274 polypeptide possess 
significant sequence similarity to the seven transmembrane receptor proteins (see, for example, 
page 2, line 27 to page 3, line 6). The isolation of cDNA clones encoding PR0274 of SEQ ID 
NO:7 is described in Example 4. Examples 100-103 describe the expression of PRO 
polypeptides in various host cells, including E. coli, mammalian cells, yeast and Baculovirus- 
infected insect cells. Finally, Example 1 14, in the specification at page 331, line 23, to page 346, 
line 4, sets forth a Gene Amplification assay which shows that the PR0274 gene is amplified in 
the genome of certain human lung and colon cancers (see page Table 9). 

The specification discloses that antibodies to PRO polypeptides may be used, for 
example, in purification of PRO (page 225, lines 5-11 and Example 106), in diagnostic assays for 
PRO expression (page 190, lines 3-9, and page 224, line 21 to* page 225, line 4), as antagonists to 
PRO (page 198, lines 3-6), and as elements of pharmaceutical compositions for the treatment of 
various disorders (page 223, line 30, to page 224, line 28). 
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6. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

L Whether Claims 58-62 satisfy.the utility requirement of 35 U.S.C. §101. 

II. Whether Claims 58-62 satisfy the enablement requirement of 35 U.S.C. §112, first 
paragraph. 

IE. Whether Claims 58-62 are patentable under 35 U.S.C. § 102(b) over Ho et al, 

Science, Vol. 289, pp 265-270. , 
IV. Whether Claims 59-62 are patentable under 35 U.S.C. § 103(a) over Ho et al., in 

view of Janeway et al. 

7. ARGUMENT 

A, Summary of the Arguments: 
Issue I: Utility 

Appellants rely upon the gene amplification data of the PR0274 gene for patentable 
utility of the PR0274 polypeptide and the antibodies that bind it. This data is clearly disclosed in 
the instant specification in Example 114, which discloses that the gene encoding PR0274 
showed significant amplification in primary lung tumors. Appellants submit that one of skill in 
the art would reasonably expect in this instance, based on the amplification data for the PR0274 
gene, that the PR0274 polypeptide is concomitantly over expressed and has utility in the 
diagnosis of lung cancer or for individuals at risk for developing lung cancer. 

The Examiner has asserted that it does not necessarily follow that an increase in gene 
copy number results in increased gene expression and increased protein expression, such that 
antibodies would be useful diagnostically. In support of these assertions, the Examiner referred 
to articles by Pennica et al and Li et al. as evidence showing a lack of correlation between gene 
(DNA) amplification and mRNA levels, as well as articles by Gygi et al, Hu et al, Nagaraja et 
al, Waghray et al, and Sagynaliev et al. as providing evidence that polypeptide levels cannot be 
accurately predicted from mRNA levels. 

Appellants submit that the teachings of the Examiner's cited references do not 
conclusively establish a prima facie case for lack of utility (as will be discussed in detail below). 
As further support for their utility claim, Appellants have submitted a Declaration by Dr. Audrey 
Goddard, which explains that a gene identified as being amplified at least 2-fold by the disclosed 
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gene amplification assay in a tumor sample relative to a normal sample is useful as a marker for 
the diagnosis of cancer , and for monitoring cancer development and/or for measuring the efficacy 
of cancer therapy. Therefore, such a gene is useful as a marker for the diagnosis of lung cancer , 
and for monitoring cancer development and/or for measuring the efficacy of cancer therapy. 
According to the Goddard Declaration, the 2-fold to 3.1 -fold amplification of PR0274 in three 
primary lung tumors would be considered significant and credible by one skilled in the art, based 
upon the facts disclosed therein. The Examiner has not provided any evidence to show that the 
disclosed DNA amplification is not significant. 

Appellants have also submitted ample evidence to show that, in general, if a gene is 
amplified in cancer, it is more likely than not that the encoded protein will be expressed at an 
elevated level. For instance, the articles by Orntoft et al, Hyman et al 9 and Pollack et al 
collectively teach that in general gene amplification increases mRNA expression . Of note, 
Appellants point out that the Examiner conceded to this argument in the preceding Advisory 
Action. 

Further, Appellants have submitted over a hundred references, along with Declarations of 
Dr. Paul Polakis, which collectively teach that, in general there is a correlation between mRNA 
levels and polypeptide levels . Appellants would also like to bring to the Examiner's attention a 
recent decision in a microarray case by the Board of Patent Appeals and Interferences (Decision 
on Appeal No. 2006-1469). In its decision, the Board reversed the utility rejection, 
acknowledging that "there is a strong correlation between mRNA levels and protein expression, 
and the Examiner has not presented any evidence specific to the PR01866 polypeptide to refute 
that." (Page 9). Appellants submit that, in the instant application, the Examiner has likewise not 
presented any evidence specific to the PR0274 polypeptide to refute Appellant's assertion of a 
correlation between DNA levels, mRNA levels and protein expression. 

Taken together, although there are some examples in the scientific art that do not fit 
within the central dogma of molecular biology that there is generally a positive correlation 
between DNA, mRNA, and polypeptide levels, in general, in the majority of amplified genes , as 
exemplified by the teachings of Orntoft et al, Hyman et al, Pollack et al, the two Polakis 
Declarations, the art in general overwhelmingly show that gene amplification influences gene 
expression at the mRNA and protein levels . Therefore, one of skill in the art would reasonably 
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expect in this instance, based on the amplification data for the PR0274 gene, that the PR0274 
polypeptide is concomitantly overexpressed and has utility in the diagnosis of lung cancer. 

Appellants further submit that even if there were no correlation between gene 
amplification and increased mRNA/protein expression, (which Appellants expressly do not 
concede to), a polypeptide encoded by a gene that is amplified in cancer would still have a 
specific, substantial, and credible utility. Appellants submit that, as evidenced by the Ashkenazi 
Declaration and the teachings of Hanna et al., simultaneous testing of gene amplification and 
gene product over-expression enables more accurate tumor classification , even if the gene- 
product, the protein, is not over-expressed. This leads to better determination of a suitable 
therapy for the tumor, as demonstrated by a real-world example of the breast cancer marker 
HER-2/neu. 

Accordingly, Appellants submit that when the proper legal standard is applied, one 
should reach the conclusion that the present application discloses at least one patentable utility 
for the claimed antibodies to PR0274 polypeptides. 

Issue II: Enablement 

Appellants respectfully submit that the data presented in Example 1 14 of the specification 
and the cumulative evidence of record support a "specific, substantial and credible" asserted 
utility for the presently claimed invention. Accordingly, one of ordinary skill in the art would 
understand how to make and use the recited antibodies for the diagnosis of lung cancer without 
any undue experimentation. 

These arguments are all discussed in further detail below under the appropriate headings. 

Issue III: Anticipation by Ho et al. 

Claims 58-62 stand rejected under 35 U.S.C. § 102(b) as being anticipated by Ho et al, 
Science, Vol. 289, pp 265-270, published July 14, 2000. 

The instant application claims priority to International Application No. 
PCT/US00/03565, which first disclosed the gene amplification results and was filed 
February 11, 2000, over five months before the publication date of Ho et al The instant 
application has not been granted the earlier priority date on the grounds that "the gene 
amplification assay fails to disclose a patentable utility for the antibodies to the protein." (Page 
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10 of the Office Action mailed July 19, 2005). Appellants respectfully submit that as discussed 
above under Issues I and n, the presently claimed invention is supported by a specific, substantial 
and credible utility and, therefore, the present specification teaches one of ordinary skill in the art 
"how to use" the claimed invention without undue experimentation. Accordingly, the instant 
application is entitled to the effective filing date of February IK 2000. and thus Ho et al is not 
prior art. 

Issue IV: Obviousness over Ho et ah in view of Janeway et al 

Claims 59-62 stand rejected under 35 U.S.C. § 103(a) as being unpatentable over Ho et al 
in view of Immunology, The Immune System in Health and Disease, Third Edition, Janeway and 
Travers, Ed., 1997. 

As discussed above, the instant application is entitled to priority to International 
Application No. PCT/US00/03565, and to the effective filing date of February 11, 2000. Thus 
Ho et al is not prior art. 

These arguments are all discussed in further detail below under the appropriate headings. 

B. Detailed Areuments 

Issue I: Claims 58-62 satisfy the utility requirement of 35 U.S.C, §101 

The sole basis for the Examiner's rejection of Claims 58-62 under these sections is that 
the data presented in Example 1 14 of the present specification is allegedly insufficient under 
applicable legal standards to establish a patentable utility under 35 U.S.C. §101 for the presently 
claimed subject matter. 

Appellants strongly disagree and respectfully traverse the rejection. 

i) The Legal Standard For Utility Under 35 U.S.C. §101 

According to 35 U.S.C. §101: 

Whoever invents or discovers any new and useful process, machine, manufacture, 
or composition of matter, or any new and useful improvement thereof, may obtain 
a patent therefor, subject to the conditions and requirements of this title. 
(Emphasis added). 
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In interpreting the utility requirement, in Brenner v. Manson, 1 the Supreme Court held 
that the quid pro quo contemplated by the U.S. Constitution between the public interest and the 
interest of the inventors required that a patent Appellant disclose a "substantial utility" for his or 
her invention, i.e., a utility "where specific benefit exists in currently available form." The 
Court concluded that "a patent is not a hunting license. It is not a reward for the search, but 
compensation for its successful conclusion. A patent system must be related to the world of 
commerce rather than the realm of philosophy." 3 

Later, in Nelson v. Bowler, 4 the C.C.P.A. acknowledged that tests evidencing 
pharmacological activity of a compound may establish practical utility, even though they may not 
establish a specific therapeutic use. The Court held that "since it is crucial to provide researchers 
with an incentive to disclose pharmaceutical activities in as many compounds as possible, we 
conclude adequate proof of any such activity constitutes a showing of practical utility." 5 

In Cross v. Iizuka, 6 the C.A.F.C. reaffirmed Nelson, and added that in vitro results might 
be sufficient to support practical utility, explaining that "in vitro testing, in general, is relatively 
less complex, less time consuming, and less expensive than in vivo testing. Moreover, in vitro 
results with the particular pharmacological activity are generally predictive of in vivo test results, 
i.e., there is a reasonable correlation there between." 7 The Court perceived, "No insurmountable 
difficulty" in finding that, under appropriate circumstances, "in vitro testing, may establish a 
practical utility." 8 



1 Brenner v. Manson, 383 U.S. 519, 148 U.S.P.Q. (BNA) 689 (1966). 

2 Id. at 534, 148 U.S.P.Q. (BNA) at 695. 

3 Id at 536, 148 U.S.P.Q. (BNA) at 696. 

4 Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. (BNA) 881 (C.C.P.A. 1980). 

5 Id. at 856, 206 U.S.P.Q. (BNA) at 883. 

6 Cross v. Iizuka, 753 F.2d 1047, 224 U.S.P.Q. (BNA) 739 (Fed. Cir. 1985). 

7 Id at 1050, 224 U.S.P.Q. (BNA) at 747. 

8 Id 
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The case law has also clearly established that Appellants' statements of utility are usually 
sufficient, unless such statement of utility is unbelievable on its face. 9 The PTO has the initial 
burden to prove that Appellants 1 claims of usefulness are not believable on their face. 10 In 
general, an Appellant's assertion of utility creates a presumption of utility that will be sufficient 
to satisfy the utility requirement of 35 U.S.C. §101, "unless there is a reason for one skilled in the 

11 12 

art to question the objective truth of the statement of utility or its scope." ' 

Compliance with 35 U.S.C. §101 is a question of fact. 13 The evidentiary standard to be 
used throughout ex parte examination in setting forth a rejection is a preponderance of the 
totality of the evidence under consideration. 14 Thus, to overcome the presumption of truth that 
an assertion of utility by the Appellant enjoys, the Examiner must establish that it is more likely 
than not that one of ordinary skill in the art would doubt the truth of the statement of utility. 
Only after the Examiner made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the Appellant. The issue will then be decided on the totality of evidence. 

The well established case law is clearly reflected in the Utility Examination Guidelines 
("Utility Guidelines"), 15 which acknowledge that an invention complies with the utility 
requirement of 35 U.S.C. §101, if it has at least one asserted "specific, substantial, and credible 
utility" or a "well-established utility." Under the Utility Guidelines, a utility is "specific" when it 
is particular to the subject matter claimed. For example, it is generally not enough to state that a 

9 In re Gazave, 379 F.2d 973, 154 U.S.P.Q. (BNA) 92 (C.C.P.A: 1967). 

10 Ibid. 

11 In re Langer, 503 F.2d 1380,1391, 183 U.S.P.Q. (BNA) 288, 297 (C.C.P.A. 1974). 

12 See also In re Jolles, 628 F.2d 1322, 206 USPQ 885 (C.C.P.A. 1980); In re Irons, 340 
F.2d974, 144 USPQ 351 (1965); In re Sichert, 566 F.2d 1154, 1159, 196 USPQ 209, 212-13 . 
(C.C.P.A. 1977). 

13 Raytheon v. Roper, .724 F.2d 951, 956, 220 U.S.P.Q. (BNA) 592, 596 (Fed. Cir. 1983) 
cert, denied, 469 US 835 (1984). 

14 In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d (BNA) 1443, 1444 (Fed. Cir. 

1992). 

15 66 Fed. Reg. 1092(2001). 
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nucleic acid is useful as a diagnostic without also identifying the conditions that are to be 
diagnosed. 

In explaining the "substantial utility" standard, M.P.E.P. §2107.01 cautions, however, 
that Office personnel must be careful not to interpret the phrase "immediate benefit to the public" 
or similar formulations used in certain court decisions to mean that products or services based on 
the claimed invention must be "currently available" to the public in order to satisfy the utility 
requirement. "Rather, any reasonable use that an Appellant has identified for the invention that 
can be viewed as providing a public benefit should be accepted as sufficient, at least with regard 
to defining a 'substantial utility.'" 16 Indeed, the Guidelines for Examination of Applications for 
Compliance With the Utility Requirement, 17 gives the following instruction to patent examiners: 
"If the Appellant has asserted that the claimed invention is useful for any particular practical 
purpose . . . and the assertion would be considered credible by a person of ordinary skill in the 
art, do not impose a rejection based on lack of utility." 

ii) The Data and Documentary Evidence Supporting a Patentable Utility 

Patentable utility for the PR0274 polypeptides and their antibodies is based upon the 
gene amplification data for the gene encoding the PR0274 polypeptide of SEQ ID NO: 7. 
Appellants respectfully submit that the data presented in Example 1 14 of the specification and 
the cumulative evidence of record support a "specific, substantial and credible" asserted utility 
for the presently claimed invention. 

Example 1 14 describes the results obtained using a very well-known and routinely 
employed polymerase chain reaction (PCR)-based assay, the TaqMan™ PCR assay, also referred 
to herein as the gene amplification assay. This assay allows one to quantitatively measure the 
level of gene amplification in a given sample, say, a tumor extract, or a cell line. It was well 
known in the art at the time the invention was made that gene amplification is an essential 
mechanism for oncogene activation. Appellants isolated genomic DNA from a variety of 
primary cancers and cancer cell lines that are listed in Table 9, including primary lung cancers of 
the type and stage indicated in Table 8 of the specification. The tumor samples were tested in 



16 



M.P.E.P. §2107.01. 



,17 



M.P.E.P. §2107 11(B)(1). 
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triplicates with Taqman primers and with internal controls, beta-actin and GADPH in order to 
quantitatively compare DNA levels between samples. As a negative control, DNA was isolated 
from the cells of ten normal healthy individuals, which was pooled and used as a control and 
also, no-template controls. Gene amplification was monitored using real-time quantitative 
TaqMan™ PCR. Table 9 shows the resulting gene amplification data. Further, Example 1 14 
explains that the results of TaqMan™ PCR are reported in ACt units, wherein one unit 
corresponds to one PCR cycle or approximately a 2-fold amplification, relative to control, two 
units correspond to 4-fold, 3 units to 8-fold amplification and so on" (emphasis added). 
Appellants respectfully submit that a ACt value of at least 1.0, which is a more than 2-fold 
increase, was observed for PR0274 in primary lung tumors LT4, LT16, and LT18. PR0274 
showed approximately 1.00-1.61 ACt units which corresponds to 2 1 00 -2 1 ' 61 fold, or 2.0-3.1 fold 
amplification in three different human primary lung tumors, which is significant and thus the 
PR0274 gene has utility as a diagnostic marker of lung cancer. 

It is also well known that gene amplification occurs in most solid tumors, and generally is 
associated with poor prognosis. 

In support, Appellants have submitted, in their Response filed September 14, 2004, a 

Declaration by Dr. Audrey Goddard. Appellants particularly draw the Board's attention to page 3 

of the Goddard Declaration which clearly states that: 

It is further my considered scientific opinion that an at least 2-fold increase in t 
gene copy number in a tumor tissue sample relative to a normal (i.e., non-tumor) 
sample is significant and useful in that the detected increase in gene copy 
number in the tumor sample relative to the normal sample serves as a basis for 
using relative gene copy number as quantitated by the TaqMan PCR technique 
as a diagnostic marker for the presence or absence of tumor in a tissue sample of 
unknown pathology. Accordingly, a gene identified as being amplified at least 
2-fold by the quantitative TaqMan PCR assay in a tumor sample relative to a 
normal sample is useful as a marker for the diagnosis of cancer, for 
monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. (Emphasis added). 

As indicated above, the gene encoding the PR0274 polypeptide shows significantly 
higher than a two fold amplification in three different lung tumors. In addition, the Goddard 
Declaration clearly establishes that the TaqMan real-time PCR method described in Example 114 
has gained wide recognition for its versatility, sensitivity and accuracy, and is in extensive use for 
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the study of gene amplification. The facts disclosed in the Declaration also confirm that based 
upon the gene amplification results, one of ordinary skill would find it credible that PR0274 is a 
diagnostic marker of lung cancer. 

Further, as discussed in detail below, Appellants have provided ample evidence in the 
form of articles from the art, like Orntoft et al, Hyman et ai, Pollack et al, and over a 100 
references (see Evidence List items 13-144) and Declarations by experts in the field of oncology 
and gene expression, i.e.: the Declarations by Dr. Paul Polakis (I and II) and by Dr. Avi 
Ashkenazi, to show that, in general, if a gene is amplified in cancer, it is "more likely than not" 
that the encoded protein will also be expressed at an elevated level. 

The Examiner has asserted that "the specification provides data showing a very small 
increase in DNA copy number, approximately 2-fold, in a few tumor samples for PR0274." 
(Page 4 of the Office Action mailed July 19, 2005). The Examiner further asserts that "it was 
imperative to find evidence in the relevant scientific literature whether or not a small increase in 
DNA copy number would be considered by the skilled artisan to be predictive of increased 
mRNA and polypeptide levels." (Page 4 of the Office Action mailed July 19, 2005). 

Appellants respectfully submit that the Examiner seems to be applying a heightened 

utility standard in this instance, which is legally incorrect. Appellants have shown that the gene 

encoding PR0274 demonstrated significant amplification, from 2.0-3.1 fold , in three lung 

tumors. As explained in the Declaration of Dr. Audrey Goddard (submitted with the Response 

filed September 14, 2004): 

It is further my considered scientific opinion that an at least 2-fold increase in 
gene copy number in a tumor tissue sample relative to a normal {i.e., non-tumor) 
sample is significant and useful in that the detected increase in gene copy 
number in the tumor sample relative to the normal sample serves as a basis for 
using relative gene copy number as quantitated by the TaqMan PCR technique 
as a diagnostic marker for the presence or absence of tumor in a tissue sample of 
unknown pathology. (Emphasis added). 

By referring to the 2.0-fold to 3.1-fold amplification of the PR0274 gene in lung tumors 
as "very small" the Examiner appears to ignore the teachings within an expert's declaration 
without any basis, or without presenting any evidence to the contrary . Appellants respectfully 
draw the Examiner's attention to the Utility Examination Guidelines (Part HB, 66 Fed. Reg. 1098 
(2001)) which state that: 
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"Office personnel must accept an opinion from a qualified expert 
that is based upon relevant facts whose accuracy is not being 
questioned; it is improper to disregard the opinion solely because 
of a disagreement over the significance or meaning of the facts 
offered". 

Thus, barring evidence to the contrary, Appellants maintain that the 2.2 to 3.1 -fold 
amplification disclosed for the PR0274 gene is significant and forms the basis for the utility 
claimed herein. 

The Examiner has further asserted that "[g]iven that PR0274 was amplified in only a 
very small number of tumors of the same type, the data do not support the implicit conclusion of 
the specification that PR0274 shows a positive correlation with lung cancer, much less that the 
levels of PR0274 would be diagnostic of such." (Page 6 of the Office Action mailed May 20, 
2004). 

Appellants emphasize that they have shown significant DNA amplification in three out of 
the lung tumor samples in Table 9, Example 1 14 of the instant specification. The fact that not all 
lung tumors tested positive in this study does not make the gene amplification data less 
significant. As any skilled artisan in the field of oncology would easily appreciate, not all tumor 
markers are generally associated with every tumor, or even, with most tumors. For example, the 
article by Hanna and Mornin (submitted with the Response filed September 14, 2004), discloses 
that the known breast cancer marker HER-2/neu is "amplified and/or overexpressed in 10%-30% 
of invasive breast cancers and in 40%-60% of intraductal breast carcinoma" (page 1, col. 1). In 
fact, some tumor markers are useful for identifying rare malignancies . That is, the association of 
the tumor marker with a particular type of tumor lesion may be rare, or, the occurrence of that 
particular kind of tumor lesion itself may be rare. In either event, even these rare tumor markers 
which do not give a positive hit for most common tumors, have great value in tumor diagnosis, 
and consequently, in tumor prognosis . The skilled artisan would certainly know that such tumor 
markers are useful for better classification of tumors. Therefore, whether the PR0274 gene is 
amplified in three lung tumors or in all lung tumors is not relevant to its identification as a tumor 
marker, or its patentable utility. Rather, the fact that the amplification data for PR0274 is 
considered significant is what lends support to its usefulness as a tumor marker. 
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• 



The Examiner has asserted that "[t]he data presented in the specification were not 
corrected for aneuploidy" and cites a reference by Sen et aL in support of the assertion that "[a] 
slight amplification of a gene does not necessarily mean overexpression in a cancer tissue, but 
can merely be an indication that the cancer tissue is aneuploid." (Page 6 of the Office Action 
mailed May 20, 2005). 

Appellants submit that it is known in the art that detection of gene amplification can be 

used for cancer diagnosis regardless of whether the increase in gene copy number results from 

intrachromosomal changes or from chromosomal aneuploidy. As explained by Dr. Ashkenazi in 

his Declaration (submitted with Appellants* Response filed September 14, 2004), 

An increase in gene copy number can result not only from intrachromosomal 
changes but also from chromosomal aneuploidy. It is important to understand that 
detection of gene amplification can be used for cancer diagnosis even if the 
determination includes measurement of chromosomal aneuploidy. Indeed, as long 
as a significant difference relative to normal tissue is detected, it is irrelevant if 
the signal originates from an increase in the number of gene copies per 
chromosome and/or an abnormal number of chromosomes. 

Hence, Appellants submit that gene amplification of a gene, whether by aneuploidy or any other 

mechanism, is useful as a diagnostic marker. 

The Examiner has asserted that "[o]ne skilled in the art would do further research to 

determine whether or not the PR0274 polypeptide levels increased significantly in the tumor 

samples. The requirement for such further research makes it clear that the asserted utility is not 

yet in currently available form, i.e., it is not substantial." (Page 4 of the Office Action mailed 

July 19,2005). 

As discussed above, M.P.E.P. §2107.01 cautions. Office personnel not to interpret the 
phrase "immediate benefit to the public" or similar formulations used in certain court decisions 
to mean that products or services based on the claimed invention must be "currently available" to 
the public in order to satisfy the utility requirement. "Rather, any reasonable use that an 
Appellant has identified for the invention that can be viewed as providing a public benefit should 
be accepted as sufficient, at least with regard to defining a 'substantial 5 utility." 18 Indeed, the 
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Guidelines for Examination of Applications for Compliance With the Utility Requirement, 19 
gives the following instruction to patent examiners: "If the Appellant has asserted that the 
claimed invention is useful for any particular practical purpose . . . and the assertion would be 
considered credible by a person of ordinary skill in the art, do not impose a rejection based on 
lack of utility." 

Appellants' position is based on the overwhelming evidence from gene amplification data 
disclosed in the specification which clearly indicate that the gene encoding PR0274 is 
significantly amplified in certain lung tumors. Based on the working hypothesis among those 
skilled in the art that if a gene is amplified in cancer, the encoded protein is likely to be expressed 
at an elevated level one skilled in the art would simply accept that since the PR0274 gene is 
amplified, the PR0274 polypeptide would be more likely than not over-expressed. Thus, data 
relating to PR0274 polypeptide expression may be used for the same diagnostic and prognostic 
purposes as data relating to PR0274 gene expression. Therefore, based on the disclosure in the 
specification, no further research would be necessary to determine how to use the claimed 
antibodies that bind to the PR0274 polypeptide, because the current invention is fully enabled by 
the disclosure of the present application. 

Accordingly, Appellants submit that based on the general knowledge in the art at the time 
the invention was made and the teachings in the specification, the specification provides clear 
guidance as to how to interpret and use the data relating to PR0274 polypeptide expression and 
that the claimed antibodies which bind the PR0274 polypeptide have utility in the diagnosis of 
cancer. 

iii) A prima facie case of lack of utility has not been established 

Appellants submit that the evidentiary standard to be used throughout ex parte 
examination of a patent application is a preponderance of the totality of the evidence under 
consideration. Thus, to overcome the presumption of truth that an assertion of utility by the 
Appellant enjoys, the Examiner must establish that it is more likely than not that one of ordinary 
skill in the art would doubt the truth of the statement of utility. Only after the Examiner has . 



19 M.P.E.P. §2107 11(B)(1). 
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made a proper prima facie showing of lack of utility, does the burden of rebuttal shift to the 
Appellant. 

The Examiner has asserted that "it does not necessarily follow that an increase in gene 
copy number results in increased gene expression and increased protein expression, such that 
antibodies would be useful diagnostically." (Page 7 of the Office Action mailed May 20, 2004). 
In support of these assertions, the Examiner referred to Pennica et al and contended that 
"Pennica et al. was cited as evidence showing a lack of correlation between gene (DNA) 
amplification and mRNA levels." (Page 4 of the Office Action mailed July 19, 2005). The 
Examiner further referred to Gygi et al, and asserted that "Gygi et al. was cited as providing 
evidence that polypeptide levels cannot be accurately predicted from mRNA levels." (Page 4 of 
the Office Action mailed July 1 9, 2005). 

As a preliminary matter, Appellants respectfully submit that it is not a legal requirement 
to establish that gene amplification "necessarily" results in increased expression at the mRNA 
and polypeptide levels, or that protein levels can be "accurately predicted." As discussed above, 
the evidentiary standard to be used throughout ex parte examination of a patent application is a 
preponderance of the totality of the evidence under consideration. Accordingly, Appellants 
submit that in order to overcome the presumption of truth that an assertion of utility by the 
Appellant enjoys, the Examiner must establish that it is more likely than not that one of 
ordinary skill in the art would doubt the truth of the statement of utility. Therefore, it is not 
legally required that there be a "necessary" correlation between the data presented and the 
claimed subject matter. The law requires only that one skilled in the art should accept that such a 
correlation is more likely than not to exist . Appellants respectfully submit that when the proper 
evidentiary standard is applied, a correlation must be acknowledged. 

Pennica et al 

The Examiner cited the abstract of Pennica et al for its disclosure that "WISP-1 gene 
amplification and overexpression in human colon tumors showed a correlation between DNA 
amplification and over-expression, whereas overexpression of WISP-3 RNA was seen in the 
absence of DNA amplification. In contrast, WISP-2 DNA was amplified in colon tumors, but its 
mRNA expression was significantly reduced in the majority of tumors compared with expression 
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in normal colonic mucosa from the same patient." From this, the Examiner correctly concluded 
that increased copy number does not necessarily result in increased polypeptide expression. The 
standard, however, is not absolute certainty. 

In fact, as noted even in Pennica et aL, "[a]n analysis of WISP-1 gene amplification and 
expression in human colon tumors showed a correlation between DNA amplification and over- 
expression..'" (Pennica et aL, pagel4722, left column, first full paragraph, emphasis added). 
Thus the findings of Pennica et aL with respect to WISP-1 support Appellants' arguments. In the 
case of WISP-3, the authors report that there was no change in the DNA copy number, but there 
was a change in mRNA levels. This apparent lack of correlation between DNA and mRNA 
levels is not contrary to Appellants' assertion that a change in DNA copy number generally leads 
to a change in mRNA level. Appellants are not attempting to predict the DNA copy number 
based on changes in mRNA level, and Appellants have not asserted that the only means for 
changing the level of mRNA is to change the DNA copy number. Therefore a change in mRNA 
without a change in DNA copy number is not contrary to Appellants' assertions. 

The fact that the single WISP-2 gene did not show the expected correlation of gene 
amplification with the level of mRNA/protein expression does not establish that it is more likely 
than not, in general, that such correlation does not exist. The Examiner has not shown whether 
the lack or correlation observed for the WISP-2 gene is typical, or is merely a discrepancy, an 
exception to the rule of correlation . Indeed, the working hypothesis among those skilled in the 
art is that, if a gene is amplified in cancer, the encoded protein is likely to be expressed at an 
elevated level, as was demonstrated for WISP-1. 

Accordingly, Appellants respectfully submit that Pennica et aL teaches nothing 
conclusive regarding the absence of correlation between amplification of a gene and over- 
expression of the encoded WISP polypeptide. More importantly, the teaching of Pennica et aL is 
specific to WISP genes. Pennica et aL has no teaching whatsoever about the correlation of gene 
amplification and protein expression in general . 

Gygi et aL 

The Examiner further cited the Gygi et aL reference to establish that "the protein levels 
cannot be accurately predicted from the level of the corresponding mRNA transcript." The 
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Examiner adds that "Gygi et al ... studied over 150 proteins. . . and found no strong correlation 
between proteins and transcript levels." (Page 7 of the Office Action mailed May 20, 2004). 

Appellants respectfully traverse and point out that, on the contrary, Gygi et al never 
indicate that the correlation between mRNA and protein levels does not exist. Gygi et al only 
state that the correlation may not be sufficient in accurately predicting protein level from the 
level of the corresponding mRNA transcript (Emphasis added) (see page 1270, Abstract). This 
result is expected, since there are many factors that determine translation efficiency for a given 
transcript, or the half-life of the encoded protein. Not surprisingly, Gygi et al concluded that 
protein levels cannot always be accurately predicted from the level of the corresponding mRNA 
transcript in a single cellular stage or type when looking at the level of transcripts across different 
genes . 

Importantly, Gygi et al did not say that for a single gene, a change in the level of mRNA 
transcript is not positively correlated with a change in the level of protein expression. Appellants 
have asserted that increasing the level of mRNA for a particular gene leads to a corresponding 
increase for the encoded protein. Gygi et al did not study this issue and says absolutely nothing 
about it. One cannot look at the level of mRNA across several different genes to investigate 
whether a change in the level of mRNA for a particular gene leads to a change in the level of 
protein for that gene. Therefore, Gygi et al is not inconsistent with or contradictory to the utility 
of the instant claims, and offers no support for the PTO's rejection of Appellants' asserted utility. 

Furthermore, Appellants note that contrary to the Examiner's statement, the Gygi data 
indicate a general trend of correlation between protein [expression] and transcript levels 
(Emphasis added). For example, as shown in Figure 5, an mRNA abundance of 250-300 copies 
/cell correlates with a protein abundance of 500-1000 x 10 3 copies/cell. An mRNA abundance of 
100-200 copies/cell correlates with a protein abundance of 250-500 x 10 3 copies/cell (emphasis 
added). Therefore, high levels of mRNA generally correlate with high levels of proteins. In 
fact, most data points in Figure 5 did not deviate or scatter away from the general trend of 
correlation. Thus, the Gygi data meets the "more likely than not standard" and shows that a 
positive correlation exists between mRNA and protein. Therefore, Appellants submit that the 
Examiner's rejection is based on a misrepresentation of the scientific data presented in Gygi et al 
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Gygi et al may teach that protein levels cannot be "accurately predicted" from mRNA 
levels in the sense that the exact numerical amounts of protein present in a tissue cannot be 
determined based upon mRNA levels. Appellants respectfully submit that the Office Action's 
emphasis on the need to "accurately predict" protein levels based on mRNA levels misses the 
point. The asserted utility for the claimed polypeptides is in the diagnosis of cancer. What is 
relevant to use as a cancer diagnostic is relative levels of gene or protein expression, not absolute 
values, that is, that the gene or protein is differentially expressed in tumors as compared to 
normal tissues. Appellants need only show that there is a correlation between DNA, mRNA, and 
protein levels, such that gene amplification and mRNA overexpression generally predict protein 
overexpression. A showing that mRNA levels can be used to "accurately predict" the precise 
levels of protein expression is not required . 

In conclusion, the Examiner has not shown that a lack of correlation between gene 
amplification: polypeptide over-expression, as observed for the WISP-2 gene, is typical. In fact, 
contrary to what the Examiner contends, the art indicates that, if a gene is amplified in cancer, it 
is more likely than not that the encoded protein will be expressed at an elevated level As noted 
even in Pennica et al, a correlation between DNA amplification: polypeptide over-expression 
was observed in the case of WISP-1 and similarly, in Gygi et al, most genes showed a 
correlation between mRNA levels and protein levels. Since the standard is not absolute 
certainty, a prima facie showing of lack of utility has not been made in this instance. 

Hu et al. 

The Examiner further cited Hu et al to the effect that genes displaying a 5-fold change or 
less in mRNA expression in tumors compared to normal showed no evidence of a correlation 
between altered gene expression and a known role in the disease. However, among genes with a 
10-fold or more change in expression level, there was a strong and significant correlation 
between expression level and a published role in the disease. (Page7 of the Office Action mailed 
July 19, 2005). 

Appellants first note that the title of Hu et al is "Analysis of Genomic and Proteomic 
Data Using Advanced Literature Mining." As the title clearly suggests, the conclusion suggested 
by Hu et al. is merely based on a statistical analysis of the information disclosed in the published 
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literature. As Hu et ah states, "We have utilized a computational approach to literature mining to 
produce a comprehensive set of gene-disease relationships." In particular, Hu et ah relied on the 
MedGene Database and the Medical Subject Heading (MeSH) files to analyze the gene-disease 
relationship. More specifically, Hu et ah "compared the MedGene breast cancer gene list to a 
gene expression data set generated from a micro-array analysis comparing breast cancer and 
normal breast tissue samples." (See page 408, right column). Therefore, Appellants first submit 
that the reference by Hu et ah only studies the statistical analysis of micro-array data and not 
gene amplification data. Thus their findings would not be directly applicable to gene 
amplification data. 

According to Hu et ah, "different statistical methods" were applied to "estimate the 
strength of gene-disease relationships and evaluated the results." (See page 406, left column, 
emphasis added). Using these different statistical methods, Hu et ah "[a]ssessed the relative 
strengths of gene-disease relationships based on the frequency of both co-citation and single 
citation." (See page 41 1, left column). It is well known in the art that various statistical methods 
allow different variables to be manipulated to affect the outcome. For example, the authors 
admit, "Initial attempts to search the literature using" the list of genes, gene names, gene 
symbols, and frequently used synonyms, generated by the authors "revealed several sources of 
false positives and false negatives." (See page 406, right column). The authors further admit 
that the false positives caused by "duplicative and unrelated meanings for the term" were 
"difficult to manage." Therefore, in order to minimize such false positives, Hu et ah disclose 
that these terms "had to be eliminated entirely, thereby reducing the false positive rate but 
unavoidably under-representing some genes. " Id. Hence, Appellants respectfully submit that in 
order to minimize the false positives and negatives in their analysis, Hu et ah manipulated 
various aspects of the input data. 

Appellants further submit that the statistical analysis by Hu et ah is not a reliable standard 
because the frequency of citation reflects only the current research interest of a molecule rather 
than the true biological function of the molecule. Indeed, the authors acknowledge that 
"[relationship established by frequency of co-citation do not necessarily represent a true 
biological link." (See page 411, right column). One would expect that genes with the greatest 
change in expression in a disease would be the first targets of research, and therefore have the 
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strongest known relationship to the disease as measured by the number of publications reporting 
a connection with the disease. The correlation reported in Hu only indicates that the greater the 
change in expression level, the more likely it is that there is a published or known role for the 
gene in the disease, as found by their automated literature-mining software. Thus, Hu's results 
merely reflect a bias in the literature toward studying the most prominent targets, and say nothing 
regarding the ability of a gene that is 2-fold or more differentially expressed in tumors to serve as 
a disease marker. 

Even assuming that Hu et al. provide evidence to support a true relationship, the 
conclusion in Hu et al. only applies to a specific type of breast tumor (estrogen receptor (Ex- 
positive breast tumor) and can not be generalized as a principle governing microarray study of 
breast cancer in general, let alone the various other types of cancer genes in general . In fact, even 
Hu et al admit that, "[i]t is likely that this threshold will change depending on the disease as well 
as the experiment. Interestingly, the observed correlation was only found among ER-positive 
(breast) tumors not ER-negative tumors." (See page 412, left column). Therefore, based on 
these findings, the authors add, "This may reflect a bias in the literature to study the more 
prevalent type of tumor in the population. Furthermore, this emphasizes that caution must be 
taken when interpreting experiments that may contain subpopulations that behave very 
differently." Id. (Emphasis added). 

More importantly, Hu et al. did not look for a correlation between changes in mRNA and 
changes in protein levels, and therefore their results are not contrary to Appellants' assertion that 
there is a correlation between the two. Appellants are not relying on any "biological role" that 
the PRQ274 polypeptide has in cancer for its asserted utility . Instead, Appellants are relying on 
the amplification of the gene encoding PR0274 in certain tumors compared to their normal tissue 
counterparts. Nowhere in Hu does it say that a lack of correlation in their study means that genes 
with a less than five-fold change in level of expression in cancer cannot serve as a diagnostic 
marke r of cancer. 

Li et al. 

The Examiner also cites Li et al as teaching that "68.8% of the genes showing over- 
representation in the genome did not show elevated transcript levels." (Page 13 of the Office 
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Action mailed November 1, 2006). Appellants respectfully point out that Li et al acknowledge 
that their results differed from those obtained by Hyman et al and Pollack et al (of record), who 
found a substantially higher level of correlation between gene amplification and increased gene 
expression. The authors note that "[t]his discordance may reflect methodologic differences 
between studies or biological differences between breast cancer and lung adenocarcinoma" (page 
2629, col. 1). In fact, as explained in the Supplemental Information accompanying the Li article, 
genes were considered to be amplified if they had a copy number ratio of at least 1 .40 . As 
discussed in Appellants 5 previous responses, and in the Goddard Declaration of record, an 
appropriate threshold for considering gene amplification to be significant is a copy number of at 
least 2.0 . As discussed above the PR0274 gene showed 2.0 fold to 3. 05 -fold amplification in 
three different lung tumors, thus meeting this standard. It is not surprising that, by using a 
substantially lower threshold for considering a gene to be amplified, Li et al would have 
identified a number of genes that were not in fact significantly amplified, and therefore did not 
show any corresponding increase in mRNA expression. The results of Li et al therefore do not 
disprove that a gene with a substantially hifiher level of gene amplification, such as PRQ274 , 
would be expected to show a corresponding increase in transcript expression. Appellants point 
out that the Examiner has found these arguments persuasive as noted in the Advisory Action 
mailed March 1,2007. 

Nagaraja et al, Waghray et aU and Sagynaliev et al. 

In further support of the assertion that "changes in mRNA expression frequently do not 
result in changes in protein expression" (page 16 of the Office Action mailed November 1, 2006; 
emphasis in original), the Examiner cites three additional references, by Nagaraja et al, Waghray 
et al, and Sagynaliev et al 

The Examiner cites Nagaraja et al as allegedly teaching that in comparisons of 
expression profiles for normal breast compared to breast cancer, "the proteomic profiles 
indicated altered abundance of fewer proteins as compared to transcript profiles." (Page 15 of 
the Office Action mailed November 1, 2006). 

Appellants respectfully submit that the fact that many more transcripts than proteins were 
found to be differentially expressed does not mean that most mRNA changes did not result in 
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correlating protein changes, but merely reflects the fact that expression levels were only 
measured at all for many fewer proteins than transcripts . In particular, the total number of 
proteins whose expression levels could be visualized on silver-stained gels was only about 300 
(page 2332, col. 1), as compared to the approximately 14,500 genes on the microarray chips for 
which mRNA levels were measured (page 2336, col. 1). Since the expression levels of so many 
fewer proteins than transcripts were measured, it is hardly surprising that a smaller absolute 
number of proteins than mRNAs were found to be overexpressed, because the protein products 
of most of the overexpressed mRNAs would not have been among the small number of proteins 
identified on the gels. 

The Examiner next cites Waghray et ah, to the effect that "for most of the proteins 
identified, there was no appreciable concordant change at the RNA level" (Page 15 of the Office 
Action mailed November 1, 2006). 

Appellants emphasize that Appellants are asserting that a measurable change in mRNA 
level generally leads to a corresponding change in the level of protein expression, not that 
changes in protein level can be used to predict changes in mRNA level. Waghray et ah did not 
take genes which showed significant mRNA changes and check the corresponding protein levels . 
Instead, the authors looked at a small and unrepresentative number of proteins, and checked the 
corresponding mRNA levels. Waghray et ah acknowledge that only "[a] relatively small set of 
genes could be analyzed at the protein level, largely due to the limited sensitivity of 2-D PAGE" 
(page 1337, col. 1). In particular, while the authors examined the expression levels ofl 6,570 
genes (page 1329, col. 2), they were able to measure the expression levels of only 1031 proteins 
(page 1333, col. 2). Waghray et ah does not teach that changes in mRNA expression were not 
correlated with changes in expression of the corresponding protein. All Waghray et ah state is 
that "for most of the proteins identified, there was no appreciable concordant change at the 
mRNA level" (page 1337, col. 2). This statement is not relevant to Appellants' assertion of 
utility, since Appellants are not asserting that changes in mRNA levels are the only cause of 
changes in protein levels. Waghray et ah do not contradict Appellants' assertion that changes in 
mRNA expression, in general, correspond to changes in expression of the corresponding protein. 
Of note, the Examiner has withdrawn the rejection based on Waghray et al. in the Advisory 
Action of March 1, 2007. 
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Lastly, the Examiner cites Sagynaliev et al, as allegedly teaching that "it is also difficult 
to reproduce transcriptomics results with proteomics tools." In particular, the Examiner notes 
that according to Sagynaliev et al, of 982 genes found to be differentially expressed in Human 
CRC, only 177 (18%) have been confirmed using proteomics technologies. (Page 16 of the 
Office Action mailed November 1 , 2006). 

The Sagynaliev et al reference, titled " Web-based data warehouse on gene expression in 
7/wman colorectal cancer" (emphasis added), drew conclusions based upon a literature survey of 
gene expression data published in Humm CRC , and not from experimental data. While a literature 
survey can be a useful tool to assist researchers, the results may greatly over-represent or under- 
represent certain genes, and thus the conclusions may not be generally applicable. In particular, 
Appellants note that, as evidenced by Nagaraja et al and Waghray et al, discussed above, the 
number of mRNAs examined in transcriptomics studies is typically much larger than the number of 
proteins examined in corresponding proteomics studies , due to the difficulties in detecting and 
resolving more than a small minority of all expressed proteins on 2D gels. Thus, the fact that only 
18% of all genes found to be differentially expressed in Human CRC have been confirmed using 
proteomics technologies does not mean that the corresponding proteins are not also differentially 
expressed, but is most likely due to the fact that the corresponding proteins were not identified on 2D 
gels, and thus their expression levels remain unknown. 

The authors of Sagynaliev et al acknowledge the many technical problems in finding 
proteomic data for CRC that can be matched to transcriptomic data to see if the two correlate. The 
authors state that "results have been obtained using heterogeneous samples in particular cell lines, 
whole tissue biopsies, and epithelial cells purified from surgical specimens." However, "Results 
obtained in cell lines do not allow accurate comparison between normal and cancer cells, and the 
presence/absence of proteins of interest has to be confirmed in biopsies." (Page 3072, left column.) 
In particular, the authors specifically note that "only a single study [1] provided differential display 
protein expression data obtained in the Humm patient, using whole tissue biopsy." (Page 3068, left 
column, second paragraph; see also, Table 2.) The examiner also notes and the authors state, "For 
CRC, there is no publication comparing mRNA and protein expression for a cohort of genes." (Page 
3077, left column, last paragraph, emphasis added.) 

Appellants further note that Table 2 shows that 6 out of 8 published proteomics studies were 
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done using 2-D PAGE. However, the authors state that "2-D PAGE or 2-D DIGE have well-known 
technological limitations . . . even under well-defined experimental conditions, 2-D PAGE parallel 
analysis of paired CRC samples is hampered by a significant variability." (Page 3077, left column, 
third paragraph.) Therefore, Appellants respectfully submit that it is well known in the art that there 
are problems associated with selecting only those proteins detectable by 2D gels. 

The Examiner further asserts that "the specification of the instant application does not 
teach a change in mRNA level of PR0274" because "[t]here are no teachings in the specification 
as to the differential expression of PR0274 mRNA in the progression of lung cancer or in 
response to different treatments of hormones (for example)." (Page 16 of the Office Action 
mailed November 1, 2006). Appellants respectfully note that the instant specification measured 
gene amplification, not mRNA expression. Appellants further submit that it is well known that 
cancers arise from the transformation of normal tissue cells to cancerous cells, thus the observed 
differences in gene amplification between normal and cancerous tissues are in fact the result of 
previously occurring changes . 

Finally, in the Advisory Action of March 1, 2007, the Examiner notes that Sagynaliev 
points out that many genes found to be differentially regulated do not play a causal role in CRC 
carcinogenesis. 

The Examiner appears to be concerned with the underlying mechanism resulting in the 
positive gene amplification results, and not with those results themselves. However, the 
Examiner's concerns regarding the causal role of PR0274 associated with any type of cancer 
versus normal tissue, in no way negate the utility of the claimed invention. Appellants are 
relying on diagnostic utility and not therapeutic utility. The fact remains that the gene 
amplification results demonstrate overexpression of PR0274 in the named tumor. One of 
ordinary skilled in the art does not need to know the underlying mechanism of the overexpression 
of PR0274, or the downstream effects of that overexpression, to practice the diagnostic utility. 
One of ordinary skill in the art, in possession of these results, would have believed it more likely 
than not that the PR0274 polypeptide and the antibodies that binds it were useful for their 
asserted utility. 

In summary, Appellants respectfully submit that the Examiner has not shown that a 
change in gene expression level in tumor as compared to normal tissue is not correlated with a 
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change in protein expression. The Patent Office has failed to meet its initial burden of proof that 
Appellants* claims of utility are not substantial or credible. The arguments presented by the 
Examiner in combination with the Pennica et al, Gygi et al and Hu et al articles, as well as 
those by Li et al, Nagaraja et al, Waghray et al, and Sagynaliev et al, do not provide sufficient 
reasons to doubt the statements by Appellants that PR0274 has utility. As discussed above, the 
law does not require that gene amplification "necessarily" results in increased expression at the 
mRNA and polypeptide levels. Therefore, Appellants submit that the Examiner's reasoning is 
based on a misrepresentation of the scientific data presented in the above cited references and 
application of an improper, heightened legal standard. In fact, contrary to what the Examiner 
contends, the art indicates that if a gene is amplified in cancer, it is more likely than not that the 
encoded protein will be expressed at an elevated level. 

iv) It is "more likely than not" for amplified genes to have increased mRNA and 
protein levels 

Appellants have submitted ample evidence to show that, in general, if a gene is amplified 
in cancer, it is more likely than not that the encoded protein will be expressed at an elevated 
level. First, the articles by Orntoft et al, Hyman et al, and Pollack et al, (made of record in 
Appellants' Response filed September 14, 2004) collectively teach that in general gene 
amplification increases mRNA expression . 

Second, Appellants have submitted over a //wndred references, along with Declarations of 
Dr. Paul Polakis with their Preliminary Amendment filed on August 3, 2006, which collectively 
teach that, in general, there is a correlation between mRNA levels and polypeptide levels . 

Third, Appellants would like to bring to the Examiner's attention a recent decision by the 
Board of Patent Appeals and Interferences (Decision on Appeal No. 2006-1469). In its decision, 
the Board reversed the utility rejection, acknowledging that "there is a strong correlation between 
mRNA levels and protein expression, and the Examiner has not presented any evidence specific 
to the PRO 1866 polypeptide to refute that." (Page 9 of the Decision). Appellants submit that, in 
the instant application, the Examiner has likewise not presented any evidence specific to the 
PR0274 polypeptide to refute Appellants' assertion of a correlation between mRNA levels and 
protein expression. 
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Thus, taken together, all of the submitted evidence supports Appellants' position that gene 
amplification is more likely than not predictive of increased mRNA and polypeptide levels. 

Appellants submit that there are numerous articles which show that generally, if a gene is 
amplified in cancer, it is more likely than not that the mRNA transcript will be expressed at an 
elevated level. For example, Orntoft et al. (Mol. and Cell. Proteomics, 2002, vol. 1, pages 37-45 
- made of record in Appellants' Response filed September 14, 2004) studied transcript levels of 
5600 genes in malignant bladder cancers, many of which were linked to the gain or loss of 
chromosomal material using an array-based method. Orntoft et al showed that there was a gene 
dosage effect and taught that "in general (18 of 23 cases) chromosomal areas with more than 2- 
fold gain of DNA showed a corresponding increase in mRNA transcripts" (see column 1, 
abstract). In addition, Hyman et al {Cancer Res., 2002, vol. 62, pages 6240-45 - made of record 
in Appellants' Response filed September 14, 2004) showed, using CGH analysis and cDNA 
microarrays which compared DNA copy numbers and mRNA expression of over 12,000 genes in 
breast cancer tumors and cell lines, that there was "evidence of a prominent global influence of 
copy number changes on gene expression levels." (See page 6244, column 1, last paragraph). 
Additional supportive teachings were also provided by Pollack et al, (PNAS, 2002, vol. 99, 
pages 12963-12968 - made of record in Appellants' Response filed September 14, 2004) who 
studied a series of primary Humm breast tumors and showed that ". . .62% of highly amplified 
genes show moderately or highly elevated expression, and DNA copy number influences gene 
expression across a wide range of DNA copy number alterations (deletion, low-, mid- and high- 
level amplification), and that on average, a 2-fold change in DNA copy number is associated 
with a corresponding 1.5-fold change in mRNA levels." Thus, these articles collectively teach 
that in general gene amplification increases mRNA expression . 

Upon consideration of Appellants arguments made in the response of February 1, 2007. 
the Examiner states in the Advisory Action mailed March 1, 2007 that the articles by Orntoft et 
al, Hyman et ai 9 and Pollack et al "are convincing that gene amplification generally results in 
increased transcription." 

In addition, in their Responses filed September 14, 2004 and August 3, 2006, Appellants 
submitted two Declaration by Dr. Polakis, principal investigator of the Tumor Antigen Project of 
Genentech, Inc., the assignee of the present application, to show that mRNA expression 
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correlates well with protein levels, in general. As Dr. Polakis explains, the primary focixs of the 
microarray project was to identify tumor cell markers useful as targets for both the diagnosis and 
treatment of cancer in //wmans. The Declaration by Dr. Paul Polakis (Polakis I - made of record 
in Appellants' Response filed September 14, 2004) explains that in the course of Dr. Polakis' 
research using microarray analysis, he and his co-workers identified approximately 200 gene 
transcripts that are present in Human tumor cells at significantly higher levels than in 
corresponding normal Humm cells. Appellants submit that Dr. Polakis 1 Declaration was 
presented to support the position that there is a correlation between mRNA levels and 
polypeptide levels. The second Declaration by Dr. Polakis (Polakis II- - made of record in 
Appellants' Response filed August 3, 2006) presented evidentiary data in Exhibit B. Exhibit B of 
the Declaration identified 28 gene transcripts out of 31 gene transcripts (i.e., greater than 90%) 
that showed good correlation between tumor mRNA and tumor protein levels. As Dr. Polakis 5 
Declaration (Polakis II) says "[a]s such, in the cases where we have been able to quantitatively 
measure both (i) mRNA and (ii) protein levels in both (i) tumor tissue and (ii) normal tissue, we 
have observed that in the vast majority of cases, there is a very strong correlation between 
increases in mRNA expression and increases in the level of protein encoded by that mRNA." 
Accordingly, Dr. Polakis has provided the facts to enable the Examiner to draw independent 
conclusions regarding protein data. Appellants further emphasize that the opinions expressed in 
the Polakis Declaration, including in the above quoted statement, are all based on factual 
findings. For instance, antibodies binding to about 30 of these tumor antigens were prepared and 
mRNA and protein levels were compared. In approximately 80% of the cases , the researchers 
found that increases in the level of a particular mRNA correlated with changes in the level of 
protein expressed from that mRNA when Human tumor cells are compared with their 
corresponding normal cells . Therefore, Dr. Polakis 1 research, which is referenced in his 
Declaration, shows that, in general there is a correlation between increased mRNA and 
polypeptide levels . Hence, one of skill in the art would reasonably expect that, based on the gene 
amplification data of the PR0274 gene, the PR0274 polypeptide is concomitantly overexpressed 
in lung tumors studied as well. Based on these experimental data and his vast scientific 
experience of more than 20 years, Dr. Polakis states that, for Humm genes, increased mRNA 
levels typically correlate with an increase in abundance of the encoded protein. He further 
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confirms that "it remains a central dogma in molecular biology that increased mRNA levels are 
predictive of corresponding increased levels of the encoded protein." 

Appellants further note that the sale of gene expression chips to measure mRNA levels is 
a highly successful business, with a company such as.Affymetrix recording 168,3 million dollars 
in sales of their GeneChip arrays in 2004. Clearly, the research community believes that the 
information obtained from these chips is useful (i.e., that it is more likely than not informative of 
the protein level). 

Taken together, although there are some examples in the scientific art that do not fit 
within the central dogma of molecular biology that there is a correlation between polypeptide and 
mRNA levels, these instances are exceptions rather than the rule. In the majority of amplified 
genes, the teachings in the art, as exemplified by Orntoft et al., Hyman et al, Pollack et al, and 
the Polakis Declaration, overwhelmingly show that gene amplification influences gene 
expression at the mRNA and protein levels. Thus, one of skill in the art would reasonably expect 
in this instance, based on the amplification data for the PR0274 gene, that the PR0274 
polypeptide is concomitantly overexpressed. Accordingly, Appellants submit that the PR0274 
polypeptides and antibodies have utility in the diagnosis of cancer and based on such a utility, 
one of skill in the art would know exactly how to use the claimed antibodies for diagnosis of 
cancer. 

With respect to the correlation between mRNA expression and protein levels, the 
Examiner asserts that the Polakis Declaration (Polakis I) is insufficient to overcome the rejection 
of claims 58-62 since it is limited to a discussion of data regarding the correlation of mRNA 
levels and polypeptide levels and not gene amplification levels. The Examiner asserts that the 
Declaration does not provide data such that the Examiner can independently draw conclusions. 
(Page 7 of the Office Action mailed July 19, 2005). 

Appellants submit that Dr. Polakis' Declaration was presented to support the position that 
there is a correlation between mRNA levels and polypeptide levels, the correlation between gene 
amplification and mRNA levels having already been established by the data shown in the Orntoft 
et al. Hyman et al, and Pollack et al. articles . Appellants emphasize that the opinions expressed 
in the Polakis Declaration, including the quoted statement, are all based on factual findings. 
Subsequently, antibodies binding to about 30 of these tumor antigens were prepared, and mRNA 
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and protein levels were compared. In approximately 80% of the cases, the researchers found that 
increases in the level of a particular mRNA correlated with changes in the level of protein 
expressed from that mRNA when Human tumor cells are compared with their corresponding 
normal cells. Dr. Polakis' statement that "an increased level of mRNA in a tumor cell relative to 
a normal cell typically correlates to a similar increase in abundance of the encoded protein in the 
tumor cell relative to the normal cell" is based on factual experimental findings , clearly set forth 
in the Declaration. Accordingly, the Declaration is not merely conclusive, and the fact-based 
conclusions of Dr. Polakis would be considered reasonable and accurate by one skilled in the art. 

The case law has clearly established that in considering affidavit evidence, the Examiner 
must consider all of the evidence of record anew. 20 "After evidence or argument is submitted by 
the Appellant in response, patentability is determined on the totality of the record, by a 
preponderance of the evidence with due consideration to persuasiveness of argument" 
Furthermore, the Federal Court of Appeals held in In re Alton, "[W]e are aware of no reason why 

22 

opinion evidence relating to a fact issue should not be considered by an examiner." Appellants 
also respectfully draw the Examiner's attention to the Utility Examination Guidelines 23 which 
state, "Office personnel must accept an opinion from a qualified expert that is based upon 
relevant facts whose accuracy is not being questioned; it is improper to disregard the opinion 
solely because of a disagreement over the significance or meaning of the facts offered." 
The statement in question from an expert in the field (the Polakis Declaration) states that "it is 
my considered scientific opinion that for Human genes, an increased level of mRNA in a tumor 
cell relative to a normal cell typically correlates to a similar increase in abundance of the encoded 
protein in the tumor cell relative to the normal cell." Therefore, barring evidence to the contrary 



20 In reRinehart, 531 F.2d 1084, 189U.S.P.Q. 143 (C.C.P.A. 1976); In re Piasecki, 745 
F.2d. 1015, 226U.S.P.Q. 881 (Fed. Cir. 1985). 

21 In re Alton, 37 U.S.P.Q.2d 1578, 1584 (Fed. Cir. 1996)(quoting/« re Oetiker, 977 
F.2d 1443, 1445, 24U.S.P.Q.2d 1443, 1444 (Fed. Cir. 1992)). 

22 M at 1583. 

23 Part IIB, 66 Fed. Reg. 1098 (2001). 
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regarding the above statement in the Polakis Declaration, this rejection is improper under both 
the case law and the Utility guidelines. 

The Examiner asserts that the second Polakis Declaration (Polakis II) is insufficient to 
overcome the utility rejection because PR0274 does not appear in the table (Exhibit B), and 
allegedly it is not clear whether PR0274 shares the same characteristics as those tumor antigens 
tested. (Page 5 of the Office Action mailed November 1, 2006). 

Appellants respectfully submit that, as discussed in their previous Responses and Appeal 
Brief, the standard for utility is more likely than not. Dr. Polakis' Declarations provide evidence, 
in the form of statements by an expert in the art, that "an increased level of mRNA in a tumor 
cell relative to a normal cell typically correlates to a similar increase in abundance of the encoded 
protein in the tumor cell relative to the normal cell." The PR0274 gene was found to be 
amplified in lung tumors. As discussed above and in Appellants' previous Responses, one of 
ordinary skill in the art would therefore expect the PR0274 mRNA to be overexpressed in the 
same Human lung tumor samples. Accordingly, one of ordinary skill in the art would understand 
that the PR0274 polypeptide would be expected (more likely than not) to be overexpressed in 
Human lung tumor samples relative to their normal Human tissue counterparts; as are the 
majority of other molecules tested . 

The Examiner further states that "levels of mRNA and protein in tumor tissue were 
compared to corresponding normal tissue, but the amplification levels of genomic DNA from 
example 114 were compared to normal Human blood, not corresponding normal tissue." (Page 5 
of the Office Action mailed November 1, 2006). Appellants respectfully note that the Polakis 
Declaration describes the results of microarray experimentation, while Example 1 14 of the 
specification discloses gene amplification data. THus the Examiner's attempt to contrast the 
methodology of the two types of experiments is misplaced. 

The Examiner appears to require Appellants to provide every single experimental detail 
involved in the testing of the mRNA/protein correlation according to the Polakis Declaration. 
Such a requirement is unreasonable because neither the law nor the Utility Guidelines requires 
Appellants to do so. 

The Examiner further notes (at page 6 of the Office Action mailed November 1, 2006) 
that Dr. Polakis is employed by the assignee. Appellants respectfully submit that note the sworn 
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Declaration of Dr. Polakis is sufficient to support Appellants' position a general mRNA/protein 
correlation, even if Dr. Polakis is an employee of the assignee. 

Based on the above arguments, Appellants have clearly demonstrated a credible, specific 
and substantial asserted utility for the PR0274 polypeptide and the claimed antibodies that bind 
them, for example, as diagnostic markers for lung tumors. Further, based on this utility and the 
disclosure in the specification, one skilled in the art at the time the application was filed would 
know how to use the claimed antibodies. 

With respect to the over one 7/wndred additional references supporting a correlation 
between mRNA and protein overexpression cited in Appellants' Preliminary Amendment filed 
August 3, 2006, the Examiner asserts that "[w]ith the exception of Futcher et al, all of 
Appellant's newly cited references. are directed to the analysis of single genes, or a small group 
of genes, and therefore do not demonstrate trends found across proteins in general." (Page 8 of 
the Office Action mailed November 1, 2006). 

Appellants note that the submitted references, which represent experiments conducted by 
a large number of different research groups, demonstrate a trend of correlation found across 
proteins in general and that this trend is confirmed by an overwhelming number of experiments 
by different researchers, using diverse experimental designs, testing various types of tissues, 
under numerous biological conditions . Although only a single gene or a small group of genes 
was tested by each individual study group, the cumulative evidence generated by over one 
//wndred study groups certainly establishes that it is well-accepted in the art that a general 
mRNA/protein correlation exists. 

In response to the submitted textbook excerpts by Alberts and Lewin (submitted in the 
IDS filed on August 3, 2006 in Exhibit 1), the Examiner acknowledges that the teachings of 
Alberts and Lewin support that the initiation of transcription is the most common point for a cell 
to regulate gene expression. The Examiner asserts, however, that the initiation of transcription 
"is not the only means of regulating gene expression" according to the teaching of Alberts. (Page 
7 of the Office Action mailed November 1 , 2006). 

Appellants respectfully submit that the utility standard is not absolute certainty. Rather, 
to overcome the presumption of truth that an assertion of utility by an Appellant enjoys, the PTO 
must establish that it is more likely than not that one of ordinary skill in the art would doubt the 
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truth of the statement of utility. Therefore, Appellants do not need to establish that transcription 
initiation is the only means of regulating gene expression in order to meet the utility standard. 
Instead, as long as it is the most common point of regulation , as admitted by the Examiner, it 
would be more likely than not that a change in the transcription level of a gene gives rise to a 
change in translation level of a gene. Appellants note that both Alberts and Lewin make clear 
that it is far more likely than not that protein levels for any given gene are regulated at the 
transcriptional level Alberts, for example, states that of all the possible points for regulating 

th 

protein expression, " f fl Qr most genes transcriptional controls are paramount " Cell 4 at 379 
(emphasis added). In a similar vein, Lewin states that "having acknowledged that control of gene 
expression can occur at multiple stages, and that production of RNA cannot inevitably be equated 
with production of protein, it is clear that the overwhelming majority of regulatory events occur 
at the initiation of transcription ." Genes VI at 847-848 (emphasis added). Thus, the utility 
standard is met. 

With respect to Appellants' arguments regarding Meric et al (submitted in the EDS filed 
on August 3, 2006 in Exhibit 1), the Examiner asserts that Meric teaches that "gene expression is 
quite complicated, and is also regulated at the level of mRNA stability, mRNA translation, and 
protein stability." (Page 8 of the instant Office Action). 

Appellants respectfully submit that Meric simply summarizes the translational regulation 
of cancer cells. Meric indicates that translation initiation is regulated in response to nutrient 
availability and mitogenic stimulation and is coupled to cell cycle progression and cell growth. 
Meric further discusses how alterations in translation control occur in cancer. For example, 
variant mRNA sequences can alter the translational efficiency of individual mRNA molecules, 
(see Abstract). Meric further teaches that the changes in translational efficiency of a mRNA 
transcript depend on the mutation of a specific mRNA sequence. (Page 973, column 2 to 
page 974, column 1). Meric never suggests that the translation of a cancer gene is suppressed in 
cancer in general and that therefore, increased mRNA levels will not, in general, yield increased 
protein levels. To the contrary, Meric teaches that the translation efficiency of a number of 
cancer genes is enhanced in cancer cells compared to their normal counterparts. For instance, in 
patients with multiple myeloma, a C-T mutation in the c-myc IRES was identified and found to 
cause an enhanced initiation of translation (page 974, column 1). Therefore the level of proteins 
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encoded by these genes increases in cancer cells at an even higher magnitude than the 
corresponding mRNA level. THus Meric clearly supports Appellants' assertions that it is more 
likely than not that, in general, changes in mRNA levels are correlated with changes in protein 
levels. 

The Examiner further alleges that Meric teaches "translational efficiency of a number of 
cancer genes is enhanced in cancer cells compare to its normal counterpart due to mutation" and 
from this asserts that "the specification does not teach that PR0274 mRNA in cancer cells have a 
mutation that would lead to increased translation." (Advisory Action mailed March 1, 2007). 

Appellants respectfully point out that Meric simply summarizes translational regulatory 
mechanisms in cancer cells and discusses how alterations in different aspects of translation 
control occur in cancer. Meric indicates that translation initiation is regulated in response to 
nutrient availability and mitogenic stimulation and is coupled to cell cycle progression and cell 
growth. Meric does not teach that mutation of a gene is required for its increased translation in 
cancer. Appellants do not need to establish the mechanism of regulating PR0274 gene 
expression in order to meet the utility standard. 

The Examiner asserts that "the majority of the newly cited references by Appellants are 
drawn to genes known or suspected to be over expressed or under expressed in cancers, and that 
are involved with cell proliferation, differentiation and/or cell adhesion/migration, in which 
expression of the protein is important in the development and progression of the cancer." (Pages 
8-9 of the Office Action mailed November 1 , 2006). 

Appellants respectfully submit that, in fact, a number of the references submitted with 
Appellants' IDS filed August 3, 2006, are drawn to proteins that are not members of the above 
protein categories and have no obvious association with cancer. To list just a few examples, 
Rudlowski et al examined the expression of glucose transporters 1-4; Papotti et al studied three 
somatostatin receptors; Van der Wilt et al studied deoxycytidine kinase; and Grenback et al 
studied galanin. 

Appellants further respectfully submit that, as discussed in their previous Responses and 
Appeal Brief, Orntoft et al, Hyman et al, and Pollack et aL, (made of record in Appellants' 
Response filed September 14, 2004), collectively teach that gene amplification increases mRNA 
expression for large numbers of genes, which have not been identified as being oncogenes or as 
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having any known functions in the development and progression of cancer . Thus the art of 
record clearly shows that there is no requirement that a polypeptide must be a known oncogene 
or a protein otherwise known to be associated with tumor growth, in order for amplification of 
the gene encoding the protein to correlate with increased protein expression. In fact, as 
demonstrated by Orntoft et al. 9 Hyman et al, and Pollack et aL, examination of gene 
amplification is a useful way to identify novel proteins not previously known to be associated 
with cancer. 

Taken together, although there are some examples in the scientific artthat do not fit 
within the central dogma of molecular biology that there is a correlation between polypeptide and 
mRNA levels, these instances are exceptions rather than the rule. In the majority of amplified 
genes , the teachings in the art, as exemplified by Orntoft et al, Hyman et al, Pollack et al, and 
the Polakis Declarations, overwhelmingly show that gene amplification influences gene 
expression at the mRNA and protein levels. Therefore, one of skill in the art would reasonably 
expect in this instance, based on the amplification data for the PR0274 gene, that the PR0274 
polypeptide is concomitantly overexpressed. Thus, Appellants submit that the PR0274 
polypeptide and the claimed antibodies that bind it have utility in the diagnosis of cancer. 

v) Even if a prima facie case of lack of utility has been established, it should be 
withdrawn on consideration of the totality of evidence 

Even if one assumes arguendo that it is more likely than not that there is no correlation 

between gene amplification and increased mRNA/protein expression, which Appellants submit is 

not true, a polypeptide encoded by a gene that is amplified in cancer would still have a specific, 

substantial, and credible utility. In support, Appellants respectfully draw the Board's attention to 

page 2 of the Declaration of Dr. Avi Ashkenazi (submitted with the Response filed September 

14, 2004) which explains that, 

even when amplification of a cancer marker gene does not result in significant 
over-expression of the corresponding gene product, this very absence of gene 
product over-expression still provides significant information for cancer diagnosis 
and treatment. Thus, if over-expression of the gene product does not parallel gene 
amplification in certain tumor types but does so in others, then parallel monitoring 
of gene amplification and gene product over-expression enables more accurate 
tumor classification and hence better determination of suitable therapy. In 
addition, absence of over-expression is crucial information for the practicing 
clinician. If a gene is amplified but the corresponding gene product is not over- 
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expressed, the clinician accordingly will decide not to treat a patient with agents 
that target that gene product. 

Appellants thus submit that simultaneous testing of gene amplification and gene product 
over-expression enables more accurate tumor classification, even if the gene-product, the protein, 
is not over-expressed. This leads to better determination of a suitable therapy. Further, as 
explained in Dr. Ashkenazi's Declaration, absence of over-expression of the protein itself is 
crucial information for the practicing clinician. If a gene is amplified in a tumor, but the 
corresponding gene product is not over-expressed, the clinician will decide not to treat a patient 
with agents that target that gene product. This not only saves money, but also has the benefit that 
the patient can avoid exposure to the side effects associated with such agents. 

This utility is further supported by the teachings of the article by Hanna and Mornin. 
(Pathology Associates Medical Laboratories, August (1999); submitted with the Response filed 
September 14, 2004). The article teaches that the HER-2/neu gene has been shown to be 
amplified and/or over-expressed in 10%-30% of invasive breast cancers and in 40%-60% of 
intraductal breast carcinomas. Further, the article teaches that diagnosis of breast cancer includes 
testing both the amplification of the HER-2/neu gene (by FISH) as well as the over-expression of 
the HER-2/neu gene product (by MC). Even when the protein is not over-expressed, the assay 
relying on both tests leads to a more accurate classification of the cancer and a more effective 
treatment of it. 

The Examiner asserts that "[t]he Hanna reference is not applicable to the instant fact 
situation, as it deals with a known tumor associated gene, and not with a prospective analysis of 
the type found in this specification." (Page 9 of the Office Action mailed July 19, 2005). To the 
contrary, Appellants have clearly shown that the gene encoding the PR0274 polypeptide is 
amplified in at least three primary lung tumors. Therefore, the PR0274 gene, similar to the 
HER-2/neu gene disclosed in Hanna et al. 9 is a tumor associated gene. Furthermore, as discussed 
above, in the majority of amplified genes, the teachings in the art overwhelmingly show that gene 
amplification influences gene expression at the mRNA and protein levels. Therefore, one of skill 
in the art would reasonably expect in this instance, based on the amplification data for the 
PR0274 gene, that the PR0274 polypeptide is concomitantly overexpressed. 
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However, even if gene amplification does not result in overexpression of the gene product 
{i.e., the protein) an analysis of the expression of the protein is useful in determining the course 
of treatment, as supported by the Ashkenazi Declaration. The Examiner has asserted that "the 
gene product of the instant invention has not been demonstrated to be involved in cancer. Over- 
expression of a gene product in a cancer cell does not necessarily mean that the gene product is 
involved in the cancer and that targeting the gene product would be therapeutic." (Page 9 of the 
Office Action mailed July 19, 2005). The Examiner appears to view the testing described in the 
Ashkenazi Declaration and the Hanna paper as experiments involving further characterization of 
the PR0274 polypeptide itself. In fact, such testing is for the purpose of characterizing not the 
PR0274 polypeptide, but the tumors in which the gene encoding PR0274 is amplified. Testing 
of tumor markers such as PR0274 is useful for tumor categorization even if the tested marker is 
not itself the intended therapeutic target . The PR0274 polypeptide is therefore useful in tumor 
categorization, the results of which become an important tool in the hands of a physician 
enabling the selection of a treatment modality that holds the most promise for the successful 
treatment of a patient. 

For the reasons given above, Appellants respectfully submit that the present specification 
clearly describes, details and provides a patentable utility for the claimed invention. 
Accordingly, Appellants respectfully request reconsideration and reversal of the rejections of 
Claims 58-62 under 35 U.S.C. §101. 

Issue II: Claims 58-62 satisfy the enablement requirement of 35 U.S.C. §112, first 
paragraph. 

Claims 58-62 stand rejected under 35 U.S.C. §112, first paragraph, allegedly "since the 
claimed invention is not supported by either a specific and substantial asserted utility or a well 
established utility for the reasons set forth above, one skilled in the art clearly would not know how 
to use the claimed invention." (Page 9 of the Office Action mailed July 19, 2005). 

In this regard, Appellants refer to the arguments and information presented above in 
response to the outstanding rejection under 35 U.S.C. § 101, wherein those arguments are 
incorporated by reference herein. Appellants respectfully submit that as described above, the 
PR0274 polypeptide has utility in the diagnosis of cancer and based on such a utility, one of skill 
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in the art would know exactly how to use the claimed antibodies that bind the PR0274 
polypeptide for diagnosis of cancer, without undue experimentation. 

Accordingly, Appellants respectfully request reconsideration and reversal of the 
enablement rejection of Claims 58-62 under 35 U.S.C. §112, first paragraph. 

Issue III: Claims 58-62 are patentable under 35 U.S.C §1 02(b) over Ho et aL 

Claims 58-62 stand rejected under 35 U.S.C. § 102(b) as being anticipated by Ho et al 9 
Science, Vol. 289, pp 265-270, published July 14, 2000. 

Appellants submit that, as discussed above in response to the outstanding rejections under 
35 U.S.C. §101 and 35 U.S.C. §112, first paragraph, for alleged lack of utility and enablement 
Appellants rely on the gene amplification results (Example 1 14) to establish a credible, 
substantial and specific asserted utility for the PR0274 polypeptide and the claimed antibodies 
that bind it. These results were first disclosed in International Application No. 
PCT/US00/03565, filed February 11, 2000. As discussed above, the disclosure of the instant 
application, which is similar to that of the earlier-filed application (PCT/US00/03565), provides 
the support required under 35 U.S.C. §112 for the subject matter of the instant claims. 
Accordingly, Appellants submit that the subject matter of the instant claims is disclosed in the 
manner provided by 35 U.S.C. §1 12 in PCT/US00/03565. Therefore, the effective filing date of 
this application is February 1 1, 2000, the filing date of PCT/US00/03565. 

The scientific journal article by Ho et ah was published on July 14, 2000, which is over 
five months after the effective filing date of the instant application; hence Ho et al is not prior 
art. 

The Examiner has asserted that the present claims are not entitled to the February 11, 
2000, filing date of PCT/US00/03565 because "the gene amplification assay fails to disclose a 
patentable utility for the antibodies to the protein." (Page 10 of the Office Action mailed July 19, 
2005). 

In this regard, Appellants refer to the arguments and information presented above in 
response to the outstanding rejections under 35 U.S.C. §101 and 35 U.S.C. §112, first paragraph, 
for alleged lack of utility and enablement. These arguments are incorporated by reference herein. 
Appellants respectfully submit that as described above under Issue I, the presently claimed 
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invention is supported by a specific, substantial and credible utility and, therefore, the present 
specification teaches one of ordinary skill in the art "how to use" the claimed invention without 
undue experimentation, as described above. 

Accordingly, Appellants respectfully request reconsideration and reversal of the rejection 
of Claims 58-62 under 35 U.S.C. § 102(b) as being anticipated by Ho et al 

Issue IV: Claims 59-62 are patentable under 35 U.S.C. §103(a) over Ho et al. in view 
of Janewav et al 

Claims 59-62 stand rejected under 35 U.S.C. §103(a) as being unpatentable over Ho et al 
in view of Immunology, The Immune System in Health and Disease, Third Edition, Janeway and 
Travers, Ed., 1997. 

As discussed above, the effective filing date of this application is February 1 1, . 2000, the 
filing date of PCT/US00/03565. The scientific journal article by Ho et al was published on July 
14, 2000, which is over five months after the effective filing date of the instant application; hence 
Ho et al is not prior art, and is not available as a reference under 35 U.S.C. §103. 

Accordingly, Appellants respectfully request reconsideration and reversal of the rejection 
of Claims 59-62 under 35 U.S.C. §103(a). 

CONCLUSION 

For the reasons given above, Appellants submit that the specification discloses at least 
one patentable utility for the antibodies of Claims 58-62, and that one of ordinary skill in the art 
would understand how to used the claimed antibodies, for example in the diagnosis of lung 
tumors. Therefore, Claims 58-62 meet the requirements of 35 U.S.C. §101 and 35 U.S.C. §112, 
first paragraph. 

Further, this patentable utility for the claimed antibodies was first disclosed in 
International Application No. PCT/US00/03565, filed February 11, 2000, priority to which is 
claimed in the instant application. Accordingly, the instant application has an effective priority 
date of February 1 1, 2000, and therefore Ho et al, Science, Vol. 289, pp 265-270, published on 
July 14, 2000, is not prior art and does not anticipate the claims under 35 U.S.C. §102(b) or 
render the claims obvious under 35 U.S.C. §103 (a) in view of Janeway et al 

Accordingly, reversal of all the rejections of Claims 58-62 is respectfully requested. 
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Please charge any additional fees, including fees for additional extension of time, or credit 
overpayment to Deposit Account No. 08-1641 (referencing Attorney's Docket No. 39780-2630 
P1C9). 

Respectfully submitted, 



Date: November 1, 2007 By: 

HELLER EHRMAN LLP 

275 Middlefield Road 




Menlo Park, California 94025-3506 
Telephone: (650) 324-7000 
Facsimile: (650) 324-0638 

SV 2304999 v1 

1 1/1/07 10:28 AM (39780.2630) 
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8, CLAIMS APPENDIX A 

Claims on Appeal 

58. An isolated antibody that specifically binds to the polypeptide of SEQ ID NO:7. 

59. The antibody of Claim 58 which is a monoclonal antibody. 

60. The antibody of Claim 58 which is a tfwmanized antibody. 

61 . An antigen binding fragment of the antibody of Claim 58. 

62. The antibody of Claim 58 which is labeled. 
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Declaration of Audrey Goddard, Ph.D. under 35 C.F.R. §1.132, with attached Exhibits A-G: 

A. Curriculum Vitae of Audrey D. Goddard, Ph.D. 

B. Higuchi, R. et aL, "Simultaneous amplification and detection of specific 
DNA sequences," Biotechnology 10:413-417 (1992). 

C. Livak, K.J., et aL, "Oligonucleotides with fluorescent dyes at opposite 
ends provide a quenched probe system useful for detecting PCR product 
and nucleic acid hybridization," PCR Methods AppL 4:357-362 (1995). 

D. Heid, C.A. et al., "Real, time quantitative PCR," Genome Res. 6:986-994 
(1996). ' ' ' 

E. Pennica, D. et aL, "WISP genes are members of the connective tissue 
growth factor family that are up-regulated in Wnt-1 -transformed cells and 
aberrantly expressed in Humm colon tumors," Proc. Natl. Acad. Sci. USA 
95:14717-14722 (1998). 

F. Pitti, R.M. et aL , "Genomic amplification of a decoy receptor for Fas 
ligand in lung and colon cancer," Nature 396:699-703 (1998). 

G. Bieche, I. et aL, "Novel approach to quantitative polymerase chain reaction 
using real-time detection: Application to the detection of gene 
amplification in breast cancer," Int. J. Cancer 78:661-666 (1998). 

Declaration of Avi Ashkenazi, Ph.D. under 35 C.F.R. §1.132, with attached Exhibit A 
(Curriculum Vitae). 

Declaration of Paul Polakis, Ph.D. under 35 C.F.R. §1.132 (Polakis I). 

Orntoft, T.F., et al., "Genome-wide Study of Gene Copy Numbers, Transcripts, and 
Protein Levels in Pairs of Non-Invasive and Invasive Humm Transitional Cell 
Carcinomas," Molecular & Cellular Proteomics 1:37-45 (2002). 

Hyman, E., et al., "Impact of DNA Amplification on Gene Expression Patterns in Breast 
Cancer," Cancer Research 62:6240-6245 (2002). 

Pollack, J.R., et al., "Microarray Analysis Reveals a Major Direct Role of DNA Copy 
Number Alteration in the Transcriptional Program of Humm Breast Tumors," Proc. Natl. 
Acad. Sci. USA 99:12963-12968 (2002). 

Hanna, J.S., et al, "HER-2/neu Breast Cancer Predictive Testing," Pathology Associates 
Medical Laboratories (1999). 

Sen, S., "Aneuploidy and cancer," Curr. Opin. Oncol. 12:82-88 (2000). 
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9. Pennica, D. et al., "WISP genes are members of the connective tissue growth factor family 
that are up-regulated in Wnt-1 -transformed cells and aberrantly expressed in Human colon 
tumors," Proc. Natl. Acad. Sci. USA 95:14717-14722 (1998). 

10. Gygi, S. P. et al., "Correlation between protein and mRNA abundance in yeast," Mol. Cell. 
Biol. 19:1720-1730(1999). 

11. Hu, Y. et al., "Analysis of genomic and proteomic data using advanced literature mining," 
Journal of Proteome Research 2:405-412 (2003). 

12. Declaration of Paul Polakis, Ph.D. under 35 C.F.R. §1.132 (Polakis II). 

13. Alberts, B., et al, Molecular Biology of the Cell (3rd ed. 1994) Cell 3rd at 453 Figure 9-2 
ofCell3rdCell3rdat403. 

14. Alberts, B., et al, Molecular Biology of the Cell (4rd ed.) In Cell 4th, Figure 6-3 on page 
302 Figure 6-90 on page 364 of Cell 4th Cell 4th at 364 Cell 4th at 379. 

15. Futcher, B., et al, Mol Cell Biol., - 19(1 1):7357-68 (1999). 

16. Grenback, E., et al, Regul Pept, - 1 17(2):127-39 (2004). 

17. Lewin, B., Genes VI Genes VI at 847-848 (1 997). 

18. Meric, F., et al, Molecular Cancer Therapeutics - 1:971-979 (2002). 

19. Papotti, M., et al, Virchows Arch. - 440(5):461-75 (2002). 

20. Rudlowski, C, et al, Am J. Clin Pathol. - 120(5):691-8 (2003). 

21. Van Der Wilt, C.L., a/., Eur J Cancer- 39(5):691-7 (2003). 

22. Li et al, 2006, Oncogene, 25: 2628-2635. 

23. Nagaraja et al, 2006, Oncogene 25: 2328-2338. 

24. Waghray et al, 2001, Proteomics, 1: 1327-1338. 

25. Sagynaliev et al, 2005, Proteomics, 5: 3066-3076. 

Items 1-7 were submitted with Appellants' Response filed September 14, 2004, and acknowledged 
as having been considered by the Examiner in the Office Action mailed July 19, 2005. 

Items 8-10 were made of record by the Examiner in the Office Action mailed May 20, 2004. 
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Item 1 1 was made of record by the Examiner in the Office Action mailed July 19, 2005. 



Items 12-21 were submitted with Appellants' Preliminary Amendment filed August 3, 2006, and 
were considered by the Examiner as indicated in the Final Office Action mailed November 1, 
2006. 

Items 22-25 were made of record by the Examiner in the Final Office Action mailed November 
1,2006. 
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10. RELATED PROCEEDINGS APPENDIX 



None. 
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DECLARATION OF AUDREY D. GODDARD, Ph.D UNDER 37 C.F.R. § 1.132 

Assistant Commissioner of Patents 
Washington, D.C. 20231 

Sir: 

1, Audrey D. Goddard, PLD. do hereby declare and say as follows: 

1 . I am a Senior Clinical Scientist at the Experimental Medicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2. Between 1 993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and determination 
of the chromosomal localization of novel genes. 

3 . My scientific Curriculum Vitae, including my list of publications, is attached to and 
forms part of this Declaration (Exhibit A). 
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Serial No.: * 
Filed:* 

4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PCR (i.e., 
"gene amplification") assay described in the above captioned patent application. 

5. The TaqMan PCR assay is described, for example, in the following scientific 
publications: Higuchi et al. 9 Biotechnology 10:413^17 (1992) (Exhibit B); Livak et aL 9 PCR 
Methods AppL 4:357-362 (1995) (Exhibit C) and Heid et al. 9 Genome Res. 6:986-994 (1996) 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica et a/., Proa 
Natl. Acad. Sci. USA . 95(25): 14717-14722 (1998) (Exhibit E); Pitti et al 9 Nature 
396(67 12);699-703 (1998) (Exhibit F) and Bieche et al 9 Int. J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et al have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et al studied the genomic amplification of a decoy 
receptor for Fas ligand in lung and colon cancer, using the quantitative TaqMan PCR assay. Bieche 
et al used the assay to study gene amplification in breast cancer. 
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7. It is my personal experience that the quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 



technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown * 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

8. I declare further that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereon. 



control. It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i.e., non-tumor) sample is significant and 
useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 



Date 




Audrey D. Xjoddard, Ph.D. 



-3- 



AUDREY D. GODDARD, Ph.D. 



Genentech, Inc. .110 Congo St. 
1 DNA Way San Francisco, CA, 94131 

South San Francisco, CA, 94080 415.841.9154 
650.225.6429 415.819.2247 (mobile) 

goddarda@gene.com agoddard@pacbell.net 



PROFESSIONAL EXPERIENCE 

Genentech, Inc. 1993-present 
South San Francisco, CA 

2001 - present Senior Clinical Scientist 

Experimental Medicine / BioOncology, Medical Affairs 

Responsibilities: 

• Companion diagnostic oncology products 

• Acquisition of clinical samples from Genentech's clinical trials for translational research 

• Translational research using clinical specimen and data for drug development and 
diagnostics 

• Member of Development Science Review Committee, Diagnostic Oversight Team, 21 CFR 
Part 1 1 Subteam 

Interests: 

• Ethical and legal implications of experiments with clinical specimens and data 

• Application of pharmacogenomics in clinical trials 



1 998 - 2001 Senior Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities: 

• Management of a laboratory of up to nineteen -including postdoctoral fellow, associate 
scientist, senior research associate and research assistants/associate levels 

• Management of a $750K budget 

• DNA sequencing core facility supporting a 350+ person research facility. 

• DNA sequencing for high throughput gene discovery, - ESTs, cDNAs, and constructs 

• Genomic sequence analysis and gene identification 

• DNA sequence and primary protein analysis 

Research: 

• Chromosomal localization of novel genes 

• Identification and characterization of genes contributing to the oncogenic process 

• Identification and characterization of genes contributing to inflammatory diseases 

• Design and development of schemes for high throughput genomic DNA sequence analysis 

• Candidate gene prediction and evaluation 
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1993 -1998 



Scientist 



Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities 

• DNA sequencing core facility supporting a 350+ person research facility 

• Assumed responsibility for a pre-existing team of five technicians and expanded the group 
into fifteen, introducing a level of middle management and additional areas of research 

• Participated in the development of the basic plan for high throughput secreted protein 
discovery program - sequencing strategies, data analysis and tracking, database design 

• High throughput EST and cDNA sequencing for new gene identification. 

• Design and implementation of analysis tools required for high throughput gene identification. 

• Chromosomal localization of genes encoding novel secreted proteins. 

Research: 

• Genomic sequence scanning for new gene discovery. 

• Development of signal peptide selection methods. 

• Evaluation of candidate disease genes. 

• Growth hormone receptor gene SNPs in children with Idiopathic short stature 

Imperial Cancer Research Fund 1989-1992 
London, UK with Dr. Ellen Solomon 

6/89-12/92 Postdoctoral Fellow 

• Cloning and characterization of the genes fused at the acute promyelocytic leukemia 
translocation breakpoints on chromosomes 17 and 15. 

• Prepared a successfully funded European Union multi-center grant application 

McMaster University 1983 
Hamilton, Ontario, Canada with Dr. G. D. Sweeney 

5/83 - 8/83: NSERC Summer Student 

• In vitro metabolism of p-naphthoflavone in C57BI/6J and DBA mice 



EDUCATION 



Ph.D. 



University of Toronto 
Toronto, Ontario, Canada. 
Department of Medical 
Biophysics. 



"Phenotypic and genotypic effects of mutations in 
the human retinoblastoma gene." 
Supervisor: Dr. R. A. Phillips 



1989 



Honours B.Sc 

"The in vitro metabolism of the cytochrome P-448 
inducer p-naphthoflavone in C57BL/6J mice." 
Supervisor: Dr. G. D, Sweeney 



McMaster University, 
Hamilton, Ontario, Canada. 
Department of Biochemistry 



1983 
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ACADEMIC AWARDS 



Imperial Cancer Research Fund Postdoctoral Fellowship 

Medical Research Council Studentship 

NSERC Undergraduate Summer Research Award 

Society of Chemical Industry Merit Award (Hons. Biochem.) 

Dr. Harry Lyman Hooker Scholarship 

J.L.W. Gill Scholarship 

Business and Professional Women's Club Scholarship 
Wyerhauser Foundation Scholarship 



1989-1992 
1983-1988 
1983 



1983 



1981-1983 
1981-1982 
1980-1981 
1979-1980 



INVITED PRESENTATIONS 

Genentech's gene discovery pipeline: High throughput identification, cloning and 
characterization of novel genes. Functional Genomics: From Genome to Function, Litchfield 
Park, AZ t USA. October 2000 

High throughput identification, cloning and characterization of novel genes. G2K:Back to 
Science, Advances in Genome Biology and Technology I. Marco Island, FL, USA. February 



Quality control in DNA Sequencing: The use of Phred and Phrap. Bay Area Sequencing 
Users Meeting, Berkeley, CA, USA. April 1999 

High throughput secreted protein identification and cloning. Tenth International Genome 
Sequencing and Analysis Conference, Miami, FL, USA. September 1998 

The evolution of DNA sequencing: The Genentech perspective. Bay Area Sequencing Users 
Meeting, Berkeley, CA, USA. May 1998 

Partial Growth Hormone Insensitivity: The role of GH-receptor mutations in Idiopathic Short 
Stature. Tenth Annual National Cooperative Growth Study Investigators Meeting, San 
Francisco, CA, USA. October, 1996 

Growth hormone (GH) receptor defects are present in selected children with non-GH-deficient 
short stature: A molecular basis for partial GH-insensitivity. 76 th Annual Meeting of The 
Endocrine Society, Anaheim, CA, USA. June 1994 

A previously uncharacterized gene, myl, is fused to the retinoic acid receptor alpha gene in 
acute promyelocyte leukemia. XV International Association for Comparative Research on 
Leukemia and Related Disease, Padua, Italy. October 1991 



2000 
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PATENTS 

Goddard A, Godowski PJ, Gurney AL. NL2 Tie ligand homologue polypeptide. Patent 
Number: 6,455,496. Date of Patent: Sept. 24, 2002. 

Goddard A, Godowski PJ and Gurney AL. NL3 Tie ligand homologue nucleic acids. Patent 
Number: 6,426,218. Date of Patent: July 30, 2002. 

Godowski P, Gurney A, Hillan KJ, Botstein D, Goddard A, Roy M, Ferrara N t Tumas D, 
Schwall R. NL4 Tie ligand homologue nucleic acid. Patent Number: 6,4137,770. Date of 
Patent: July 2, 2002. 

Ashkenazi A, Fong S, Goddard A, Gurney AL, Napier MA, Tumas D, Wood Wl. Nucleic acid 
encoding A-33 related antigen poly peptides. Patent Number: 6,410,708. Date of Patent:: 
Jun. 25, 2002. 

Botstein DA, Cohen RL, Goddard AD, Gurney AL, Hillan KJ, Lawrence DA, Levine AJ, 
Pennica D, Roy MA and Wood Wl. WISP polypeptides and nucleic acids encoding same. 
Patent Number: 6,387,657. Date of Patent: May 14, 2002. 

Goddard A, Godowski PJ and Gurney AL. Tie ligands. Patent Number: 6,372,491. Date of 
Patent: April 16, 2002. 

Godowski PJ, Gurney AL, Goddard A and Hillan K. TIE ligand homologue antibody. Patent 
Number: 6,350,450. Date of Patent: Feb. 26, 2002. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Tie 
receptor tyrosine kinase ligand homologues. Patent Number: 6,348,351. Date of Patent: . 
Feb. 19, 2002. 

Goddard A, Godowski PJ and Gurney AL. Ligand homologues. Patent Number: 6,348,350. 
Date of Patent: Feb. 19, 2002. 

Attie KM, Carlsson LMS, Gesundheit N and Goddard A. Treatment of partial growth 
hormone insensitivity syndrome. Patent Number: 6,207,640. Date of Patent: March 27, 
2001. . 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Nucleic 
acids encoding NL-3. Patent Number: 6,074,873. Date of Patent: June 13, 2000 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,824,642. Date of Patent: October 20, 1998 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,646,1 13. Date of Patent: July 8, 1997 

Multiple additional provisional applications filed 
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PUBLICATIONS 

Seshasayee D, Dowd P, Gu Q, Erickson S, Goddard AD Comparative sequence analysis of 
the HER2 locus in mouse and man. Manuscript in preparation. 

Abuzzahab MJ, Goddard A, Grigorescu F, Lautier C, Smith RJ and Chernausek SD. Human 
IGF-1 receptor mutations resulting in pre- and post-natal growth retardation. Manuscript in 
preparation. 

Aggarwal S, Xie, M-H, Foster J, Frantz G, Stinson J, Corpuz RT, Simmons L, Hillan K, 
Yansura DG, Vandlen RL, Goddard AD and Gurney AL. FHFR, a novel receptor for the 
fibroblast growth factors. Manuscript submitted. 

Adams SH, Chui C, Schilbach SL, Yu XX, Goddard AD, Grimaldi JC, Lee J t Dowd P, Colman 
S., Lewin DA. (2001) BFIT, a unique acyl-CoA thioesterase induced in thermogenic brown 
adipose tissue: Cloning, organization of the human gene, and assessment of a potential link 
to obesity. Biochemical Journal 360: 135-142. 

Lee J. Ho WH. Maruoka M. Corpuz RT. Baldwin DT. Foster JS. Goddard AD. Yansura DG. 
Vandlen RL. Wood Wl. Gurney AL. (2001) IL-17E, a novel proinflammatory ligand for the IL- 
17 receptor homoldg IL-17RM. Journal of Biological Chemistry 276(2): 1660-1664. 

Xie M-H, Aggarwal S, Ho W-H, Foster J, Zhang Z, Stinson J, Wood Wl, Goddard AD and 
Gurney AL. (2000) Interleukin (IL)-22, a novel human cytokine that signals through the 
interferon-receptor related proteins CRF2-4 and IL-22R. Journal of Biological Chemistry 275: 
31335-31339. 

Weiss GA, Watanabe CK, Zhong A, Goddard A and Sidhu SS. (2000) Rapid mapping of 
protein functional epitopes by combinatorial alanine scanning. Proc. Natl. Acad. Sci. USA 97: 
8950-8954. 

Guo S, Yamaguchi Y, Schilbach S, Wada T.;Lee J, Goddard A, French D , Handa H, 
Rosenthal A. (2000) A regulator of transcriptional elongation controls vertebrate neuronal . 
development! Nature 408: 366-369. 

Yan M, Wang L-C, Hymowitz SG, Schilbach S, Lee J, Goddard A, de Vos AM, Gao WQ, Dixit 
VM. (2000) Two-amino acid molecular switch in an epithelial morphogen that regulates 
binding to two distinct receptors. Science 290: 523-527. 

Sehl PD, Tai JTN, Hillan KJ, Brown LA, Goddard A, Yang R, Jin H and Lowe DG. (2000) 
Application of cDNA microarrays in determining molecular phenotype in cardiac growth, 
development, and response to injury. Circulation 101: 1990-1999. 

Guo S, Brush J, Teraoka H, Goddard A, Wilson SW, Mullins MC and Rosenthal A. (1999) 
Development of noradrenergic neurons in the zebrafish hindbrain requires BMP, FGF8, and 
the homeodomain protein soulless/Phox2A. Neuron 24: 555-566. 

Stone D, Murone, M, Luoh, S, Ye W, Armanini P, Gurney A, Phillips HS, Brush, J, Goddard 
A, de Sauvage FJ and Rosenthal A. (1999) Characterization of the human suppressor of 
fused; a negative regulator of the zinc-finger transcription factor Gli. J. Cell Sci. 112: 4437- 
4448. 

Xie M-H, Holcomb I, Deuel B, Dowd P, Huang A, Vagts A, Foster J, Liang J, Brush J, Gu Q, 
Hillan K, Goddard A and Gurney, A.L. (1999) FGF-19, a novel fibroblast growth factor with 
unique specificity for FGFR4. Cytokine 11: 729-735. 
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Yan M, Lee J, Schilbach S, Goddard A and Dixit V. (1999) mE10, a novel caspase 
recruitment domain-containing proapoptotic molecule. J. Biol. Chem. 274(15): 10287-10292. 

Gurney AL, Marsters SA, Huang RM, Pitti RM, Mark DT, Baldwin DT, Gray AM, Dowd P, 
Brush J, Heldens S, Schow P, Goddard AD, Wood.WI, Baker KP, Godowski PJ and 
Ashkenazi A. (1999) Identification of a new member of the tumor necrosis factor family and its 
receptor, a human ortholog of mouse GITR. Current Biology 9(4): 21 5-218. 

Ridgway JBB, Ng E, Kern JA ,Lee J, Brush J, Goddard A and Carter P. (1999) Identification 
of a human anti-CD55 single-chain Fv by subtractive panning of a phage library using tumor 
and nontumor cell lines. Cancer Research 59: 2718-2723. 

Pitti RM, Marsters SA t Lawrence DA, Roy M, Kischkel FC, Dowd P, Huang A, Donahue CJ, 
Sherwood SW, Baldwin DT, Godowski PJ, Wood Wl, Gurney AL, Hillan KJ, Cohen RL, 
Goddard AD, Botstein D and Ashkenazi A. (1998) Genomic amplification of a decoy receptor 
for Fas ligand in lung and colon cancer. Nature 396(6712): 699-703. 

Pennica D, Swanson TA, Welsh JW, Roy MA, Lawrence DA, Lee J, Brush J, Taneyhill LA, 
Deuel B, Lew M, Watanabe C, Cohen RL, Melhem MF, Finley GG, Quirke P, Goddard AD, 
Hillan KJ, Gurney AL, Botstein D and Levine AJ. (1998) WISP genes are members of the 
connective tissue growth factor family that are up-regulated in wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors. Proc. Natl. Acad. Sci. USA. 95(25): 14717- 
14722. 

Yang RB, Mark MR, Gray A, Huang A, Xie MH, Zhang M, Goddard A, Wood Wl, Gurney AL 
and Godowski PJ. (1998) Toll-like receptor-2 mediates lipopolysaccharide-induced cellular 
signalling. Nature 395(6699): 284-288. 

Merchant AM, Zhu Z, Yuan JQ, Goddard A, Adams CW, Presta LG and Carter P. (1998) An 
efficient route to human bispecific IgG. Nature Biotechnology 16(7): 677-681. 

Marsters SA, Sheridan JP, Pitti RM, Brush J, Goddard A and Ashkenazi A. (1998) 
Identification of a ligand for the death-domain-containing receptor Apo3. Current Biology 8(9): 
525-528. 

Xie J, Murone M, Luoh SM, Ryan A, Gu Q, Zhang C, Bonifas JM, Lam CW, Hynes M, 
Goddard A, Rosenthal A, Epstein EH Jr. and de Sauvage FJ. (1998) Activating Smoothened 
mutations in sporadic basal-cell carcinoma. Nature, 391(6662): 90-92. 

Marsters SA, Sheridan JP, Pitti RM, Huang A, Skubatch M, Baldwin D, Yuan J, Gurney A, 
Goddard AD, Godowski P and Ashkenazi A. (1997) A novel receptor for Apo2LfTRAIL 
contains a truncated death domain. Current Biology. 7(12): 1003-1006. 

Hynes M, Stone DM, Dowd M, Pitts-Meek S, Goddard A, Gurney A and Rosenthal A. (19.97) 
Control of cell pattern in the neural tube by the zinc finger transcription factor Gli-1. Neuron 
19: 15-26. 

Sheridan JP, Marsters SA, Pitti RM, Gurney A., Skubatch M, Baldwin D, Ramakrishnan L, 
Gray CL, Baker K, Wood Wl, Goddard AD, Godowski P, and Ashkenazi A. (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and Decoy Receptors. Science 277 
(5327): 818-821. 
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Goddard AD, Covello R, Luoh SM, Clackson T, Attie KM, Gesundheit N, Rundle AC, Wells 
JA, Carlsson LMTI and The Growth Hormone Insensitivity Study Group. (1995) Mutations of 
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We have enhanced the polymerase chain 
reaction (PGR) such that specific DNA 
sequences can he detected without open- 
ing the reaction tube. This enhancement 
requires the addition of ethidium bromide 
(EtBr) to a PGR. Since the fluorescence of 
EtBr increases in due presence of double* 
stranded (ds) DNA an increase in fluores- 
cence in such a PGR indicates a positive 
amplification, which can be easily moni- 
tored externally* In fact, amplification can 
be continuously monitored in order to 
follow its progress. The Ability to simulta- 
neously amplify specific DNA sequences 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiring high sample through- 
put 

Although the potential benefit* of PCR 1 to. clin- 
ical d&gnostib arc well knowi* 2 - 5 , it is still net 
widely used in this setting, even though it w 
four year* tiuco thcrn*o*t*bl« DMA poly™**-* 
ase* 4 made PCR practical Some of the reasons for its slow 
acceptance are high cost, tack, of automation of pre-? and 
post-PCR processing steps, and false positive results, from 
<2rryovcT-cOntamination. The first two points arc related 
in that tabor is the largest contributor to cost ait the present 
stage of PCR development. Most Current assays require 
some form of "downstream" processing once tbermocy* 
ding is done in order jo determine whether the target 
DNA sequence was present and has amplified. These 
include DNA hybridization**, gel electrophoresis with or 
without use of restriction digestion 7 ;*; HPLC*, or capillary 
electrophoresis 10 - These methods are labor-intense, have, 
bw throughput, and arc difficult to automate. The third 
point is abo closely related to downstream processing. 
The handling of the PCR product in these downstream 
processes increases the chances that amplified DNA will 
spread through the typing lab, resulting in a risk of 



carryover" false positives in subsequent testing". 

These downstream processing steps would be elimi- 
nated if specific amplification and detection of amplified 
DNA took place simultaneously within an unopened re- 
action vessel Assays m which such different processes take 
place without. the need to separate reaction components 
have been termed ' 'homogeneous". .No truly homoge- 
neous PCR assay has been demonstrated to date, although 
progress towards this end has been reported. Chehab, et 
al. 1 ^, developed a PCR product detection scheme using 
fluorescent primers that resulted in a fluorescent PCR 
product Allck-speciric primers, each with different fluo- 
rescent tags, were used to indicate the genotype of the 
DNA. However, the unincorporated primers must still be 
removed in a downstream process in order to visualize the 
result Recently, Holland, et al. is , developed an assay in 
which the endogenous 5 r exonudease assay of TTdjf'DNA 
polymerase was exploited to deave a labeled oligonucleo- 
tide probe. The probe would only ckave if PCR axnpCfc- 
cation had produced its complementary sequence. In 
order to detect the dcavage products, however, a subse- 
quent process is again needed. 

We have developed a truly homogeneous assay for PCR 
and PCR product detection based upon tbc greariy in- 
creased fluorescence that ethidium bromide and other 
DNA binding dyes exhibit when they are bound to.ds- 
DNA t4 -u As outlined in Figure J, a prototype PCR 
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TOtfttE 1 Principle of simultancoua amplificaidon and- detection of 
PCR product The components of a PCR contsrinhi^ EtBr chat a 
fluorescent are Kswd—EtBr itself, EtBr bound to other ssDNA 
daDN A. There vt a large thiorcsccnce enhancement when EtBr is 
bound to 0NA and binding u greatly enhanced when DNA .is 
douhlc-stracndcd. After sumdent <n)..cydcs of PCR* the.net 
lRGmue in ^iipNA residts in additiocal EtBr binding, and a net 
increase in total fluorescence. 
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fWRC 31 Gel electrophoresis of PCR am plificanon prod ucts of ihc 
human, mtdear gene, HLA DQti, made in the presence of 
increasing amount of EtBr (up io 8 Vg/tnl). The presence of 
EtBr Eias "no obvious effect on the yield or specificity of amplifi- 
cation. 
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HGQXE % (A) Fluorescence measurements from PCRs that contain 
0.5 |fcgfrn! EtBr and that are specific for Y-chrotnosorioc repeat 
seouence*. Five replicate PCRs were begun containing' each of tbe 
DNA* specified. At each indicated cycle, one of the five replicate 
PCRs for each DNA was removed from thennocydmg and Hs 
fluorescence measured, Unit* of fluorescence art arbitrary* (B) 
UV photography of PCR tube* (0.5 ml Eppcndorf^stylc, poTypro* 
pylcne mtcjxHxntrifujrc tubes) containing reactions, those 5ta.tt> 
ing from 2 ng male DNA and control reaction? without any DNA, 
from (AX 
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begins with primers that are single-stranded DNA (ss- 
DNA)> dNTPs, and DNA polymerase; An amount of 
dsDNA containing the target sequence (target DNA) is 
also typically present. This amount can vary, depending 
on the application, from single-ceil amount* of DNA IT to 
micrograms per PCU^ e , If EtBr is present the reagents 
that will fluoresce, in order of increasing fluorescence, are 
free ElBr itself, and EtBr bound to the single-stranded 
DNA primers and to the double-stranded target DNA (by 
its intercalation between the stacked bases of the DNA 
doubk>hcfix). After the first denaturation cyde, target 
DNA will be largely single-stranded. After a PCR is 
completed! the most significant change is the increase in 
the amount of dsDNA (tbe PGR product itself) of up to 
several micrograms. Formerly free EtBr is bound to the 
additional dsDNA, resulting in an increase in fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because; the binding of ElBr to ssDNA 
is much less than to dsDNA, the effect of this change on 
the total fluorescence of the sample is smaJU The fluores- 
cence mcrease can be measured by directing excitation 
illumination through the walls of the amplification vessel 
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before and after, or even continuously during, thermocy- 
ding. 

RESULTS 

PCR in the presence of EtBr. In order to assess lb? 
affect of EtBr in FOR, amplifications of the human HLA 
DQa gene** were performed with the dye present at 
concentrations from 0,06 to 8,0 ^gfaril (a typical concen- 
tration of EtBr used in staining of nuctek acids following 
g«t electrophoresis is 0*5 M-gfrnf)* As shown in Figure 2, get 
electrophoresis revealed little or no difference in the yield 
or quality of tbe amplification product whether EtBr was 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not inhibit PCR, 

Detection of human Y-chromowrao specific w 
«$nences* Seqiience-ipedfic^ fluorescence enhancement of 
EtBr as a result of PCR was demonstrated in a series of 
amplifications containing 0*5 u.g/ml EtBr and primers 
specific to repeat DNA sequences found on the human 
Y-chromosomc^- These PCRs initially contained cither 
60 ng male, 60 ng female, 2 ng roak human or no DNA. 
Five replicate PCRs were begun for each DNA. After 9, 
17, 21 , 24 and 29 cycles of thenuocyding, a FOR for each 
DNA was removed from the thenuoeyder, and its. fluo- 
rescence measured in a spectrofmorotneter and plotted 
vs, amplification cyde number (Fig. 3 A), The shape of this 
curve reflects the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA is 
becoming linear and not exponential with cyde number; 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain* 
iog human male DNA, but did not sigriifkantiy increase 
for negative control PCRs, which contained either no 
DNA or human female DNA. The more maJe DNA 
present to begin with— 60 ng versus 2 ng— the fewer 
cycles were needed to give a detectable increase in fluo- 
rescence. Gel electrophoresis oo the products of these 
amplifications showed that DNA fragments of the ex* 
petted skc were made in the made DNA conWnmg 
reactions and that Ikde DN A synthesis took place in the 
control samples. 

In addition, the increase io fluorescence was visualized 
by simply laying the completed, unopened PCRs on a UV 
tran silfuminatOT and photographing them through a red 
filter. This is shown in figure 3B for the reactions thai 
began with 2 ng male DNA and those with no DNA* 

Detection of specific allele* of the human 0-globin 
gene. In order to demonstrate that this approach has 
adequate spedfidcy to allow genetic screening, a dttcction 
of the $k)de-cdl anemia mutation was performed' Figure 
4 shows the fluorescence from completed amptHfcatio*u 

containing EtBr (0.5 |i.g/ml) a* d«t6£t6d by photography 

of the reaction cubes on a UV oansilluminator. These 
reactions were performed using- printer* specific for ci- 
ther the witd-tvpe or skkk-cell mutation of the human 
p-globin gene**. The specificity for each allele is imparted 
by placing the sickle-mutation site at the terminal 3' 
nucleotide of one primer. By using an appropriate primer 
annealing temperature, primer extension— and thus am- 
plification — can take place only if the S' nucleotide of the 
primer is complementary to the 3*gtobin allele presort* ■ 
Each pair ol amplmcations shown in Figure 4 consists of 
a reaction with either the wiM-typc allele specific (left 
tube) or skklc-aUeie spedfic (right lube) primers. Three 
different DNAs were typed: DNA from a homozygous! 
wild-type p-giobin individual (AA); from a heterozygous 
sickle p-glpbin individual (AS); and from a homozygous 
sickle p-giobio individual (SS). Each DNA (50 ng genomic 
DNA to start each PGR) was analyzed m triplicate (3 pairs 
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0 f pactions each). The DNA .type vas reflected in the 
Illative fluorescence intensities in each pair of completed 
- ^plication*. There was a significant increase in fluores* 
(^yioe only where a ^-globin aDele DNA matched the 
urimcr »ct. When measured on a spcctroflnororactcr 
J data not shown), this fiuorcsccncc was about three times 
,jut present in a PCR where both p-globm alkies were 
^matched to the primer set. Gel ckotropboresfci (not 
dhown) established that this increase in fluorescence was 
<jue to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for p-^lobin. There was 
litdc synthesis of dsDNA b reactions in which the aflele- 
uiecinc primer was mismatched to both alleles. 

Continuous nwwtoring of a PCR. Using a fiber optic 
dtvkerii i& possible to direct excitation illumination from 
spectrofluorometer to a PCR undergoing thcrmocycling 
an d to rctirrn its fluorescence to the spectroftuorotweter. 
The fluorescence readout of such an arrangement, di- 
rected at an EtBr*containing amplification of Y<hromo- 
some spcci6c scqvcnccs from 25 ng of human male DNA, 
is shown iri Figure 5. The readout from a control FCR 
whh no target DNA is also shown. Thirty cycles of PCR 
were monitored for each- 

The fluorescence trace as a function of time dearly 
shows the effect of the thermocyding. Fluorescence inten- 
sity riaes and .fails inversely with temperature. The fluo- 
rescence intensity is minimum at the denaturalion tem- 
perature (94°C) and roaxirnurn at the anneaUn ^extension 
temperature (50°C). In the negative-control FCR> these 
fluorescence maxima and minima do not change signifi- 
cantly over the thirty therxoocydes, indicating that there is 
little dsDNA synthesis without the appropriate target 
DNA, and there is little if any Wc^hlng of £*Br during 
the continuous illumination Of the sample. 

in the PCR containing male DNA, the fluorescence 
maxima at the annealing/excension temperature begin to 
increase at about 4000 seconds of thc^ecycling, and 
continue to increase with time, indicating that dsDNA is 
being produced at a detectable JeveL Note that the fluo- 
rescence minima at the denatuTaobn temperature, do not 
aignificandy increase, presumably because al this temper- 
ature there is no dsDNA for EtBr to bind- Tiros the course 
of the amplification is followed by tracking the fluores-. 
cence increase at the aancaKng temperature. Analysis of 
the products of these two amplifications by gel electropho- 
resis showed a DNA fragment of the expected size for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sampte. 

DISCUSSION 

Downstream processes such as hybridization lo a se- 
quence-specific probe can enhance tlie specificity of DNA 
deception 1>> FCR* The cUxnioation of ihcac processes 
means that' the spctfneUy of this homogeneous assay 
depends solely on that of FCtL In the case of gicKle-celi 
diatraep we have shown that PGR alone has sufficient DNA 
sequence BperificUy to permit genetic screening. Using 
appropriate amplification conditions, there is little non* 
specific production of ckDNA in the absence of the 
appropriate target allele. 

The specificity required to detect pathogens can be 
more or less than that required' to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with die 
sample. A difficult target is HIV, which requires detection 
of a viral genome that can be at the level of a few copies 
per thousands of host cells* Cornpared with genetic 
screening, which is performed on ceils containing at least 
one copy of die target sequence, HIV idetection requires 
both more spetindty and the input of mote (oral 
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c T3phy of PCR tubes containinfi *unpEficathms 

using EtJBr that art specific to wiW-typc (A) or liexfc (5) alleles of 
the human 0-gIobin gene. The left of e^ch pair of tubes contain* 
aflele-frpcdfic primers to the wild-type alleles, the right tube 
prixners to the sicWe atWc- The photottraph wa* talcn after SO 
cycles of PC^.and the input DNAs and the alleles ihey coatam 
are indicated. Fifty 'eg of DNA was used to begin PCR. Typing 
was done in triplicate (3 pair* of PCRs) for each input DNA: 
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HOOK 5 Continuous, real-time monitoring of a PCR. A fibcr optic 
was oscd to carry excitation light to a PCR m progress and also 
emitted light bati to a Huoromctcr (see Exoenttcntol Protocol). 
AnroiificaBon using human malo-DNA specific pnmcn in a PCR 
SW&ng with tO ng of human male DNA (top), or in a control 
PCR without DNA. (bottom), were- monitored. Thirty cyde? of 
PCR were foJiowed for each. The temperature cycled between 
94*C (denaturiiticm) and 50*C (annealing and extension). Note in 
the male DNA PCR*. the cycle (time) dependent increase in 
fluorescence at the aoueanttf^ension temperature 
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DN A— lip to micrograro amoun&~HQ order to have suf- 
ficient numbers of target sequences. This large amount of 
starting DNA m an amplification simiG&urty increases 
the background fluorescence over whicb any additional 
fluorescence produced by VCR. must be detected. An 
additional complication that occurs with targets in tow 
copy-number h the formation of the 4 *prjraer-dimer" 
artifact. This is the result of the extension of one primer 
using the other primer 35 a template. Although this occurs 
infrequently, once it occur* the extension product is a 
substrate for PCR amplification, and can compete with 
true PCR targets if those targets are rare, *I*he primer- 
dimcr product is of course d$DNA and thus is a potential 
source of false signal in this homogeneous assay. 

To increase PGR specificity ana reduce the effect of 
primer-dimcT amplification, we are investigating a num- 
ber of approaches, including the use of nested-primer 
amplifications that take place in a single tube 8 , and the 
*liot-start", in which nonspecific amplification is reduced 
by raising the temperature of the reaction before DNA 
synthesis begins 25 . Preliminary results using these ap- 
proaches suggest tbat j>rixncr-diin;cT is effectively reduced 
and it is possible to detect the increase in EtBr fluores- 
cence in a PCR instigated by a single HIV genome in a 
background of 10* celts. With larger number* of cells, the 
background fluorescence contributed by genomic DNA 
becomes problematic. To reduce this background, it may 
be possible to use sequence -sped fie DNA-binding dyes 
that can be made to preferentially bind PCR product over 
genomic DNA by incorporating the dye-binding DNA 
sequence into the PCR product through a 5' *add-on" to . 
the oUronudeottdc prbacr* 1 . 

We nave shown that the detection of fluorescence 
generated by an EtBr-containing PCR is straightforward, 
both once PGR is completed and continuously during 
ihermocyding. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The fluorescence analysis 
of completed PCRs is already possible with existing instru- 
mentation in 96-well format*. In this format, the fluores- 
cence in each PCR can be cjuantitated before, after, and 
even at selected points during therroocyciing by moving 
the rack of PCRs to a 9o-microwcll plate fluorescence 
reader 40 . 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A direct extension of the apparatus used here is to have 
multiple fiberoprJcs transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number. Figure 3 shows mat 
the larger the amount of starting target DNA, the sooner 
during Pf^R a fluorescence increase is detected. Prelimi- 
nary ejcpcriincrits {Wiguchi and DoUinger, manuscript in 
preparation) with continuous monitoring have shown a 
sensitivity to two-fold differences in initial target DNA 
concentration. 

Conversely, if the number of target molecules is 
known — as it can be in genetic screening— ^continuous 
monitoring may provide a means of detecting fatac posi- 
tive and false negative result*. With a knemn number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR. 
increases in fluorescence detected before or after that 
cycle would indicate potential artifacts. False negative 
results due to, for example,. inhibition of DNA polymer' 
ase, may be detected by including within each PCR an 
inefficiently amplifying marker. This marker Tcsults in a 
fluorescence increase only after a large number of cy- 
cles— many more than arc necessary co detect a true 



positive. If a sample fails to ha\-e a fluorescence increase 
alter this many cycles, inhibition may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event, before any test based on this 
principle is ready for the clinic, an assessment of its false 
positive/false negative rates will need to be obtained using 
a large number of known samples. 

In summary, the inclusion m PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect specific DNA amplification from outside 
ihe PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
in applications that demand the high throughput of 
samples. 



EXPERIMENTAL PROTOCOL 

Human HLA-DQ« geae impHfteaUotts «mtaining Etifc. 
PCRs were set up in 10D f*l volumes containing 10 mM ThVHQ % 
pH 8.3; 50 mM RC1; 4 mM MgO*: t£ units of toe DNA 
polymerase (Perkm-Elmcr Genu, Norwalk, CT); 20 prriole each 
of human HLA-DQa " gene specific oligonucleotide primers 
OH26 and CH27 1S and approximately W copies of DQd PCft 
product diluted from a previous inaction. Ethidium bromide 
(El Br; SigmnJ was wed at the concentrations indicated in Figure 
2. Thercnocyding proceeded for 20 cycles in a model 480 
Uiermocyctcr <PerfejivElmer Ccnia, Norwalk, CT) using a "rtcp- 
cydc" program of 94*C for 1 min. dcraiuratiwi and WT* for SO 
sec annealing and 72°C for 30 ice. extension. 

Y-cbrozm>9oxDC specific PCR. PCRs (J00 ul total reaction 
volume) cemtaming tU> v$Um\ EtBr were prepared as described 
for HLA-DQo, except *m\ different primers and target JDNAs. 
These PCRs conWincd 1 5 pmolc each male DN A-*pccific primes 
YI . 1 and V 1.2*°, and cither 60 ng male, 60 eg female, 2 ng mate, 
or no human DNA* Thennocycling was 94*CTor I min. and 60?C 
for I min using a VctnCyele* program. The number of cycttt for 
a sample were as indicated in ngui* 3. Fluorescence measure- 
ment is described below. 

Allcicspccific, human fs-gtobw, groe PCR, Aropttncauons of 
100 fi4 volume usm£ 0 5 jtg/ml of £iBr were prepared a« 
described for HLA-DQ& above except with different . priincrs and 
target DNAfi. These PCRs contained eilW. primer pair HOPS' 
Hp HA <wfld-type globin specific primers) or HGFSWHpliS (siefc- 
lc-giobin specific primers) at 10 pmole each piimcr per PCR. 
Tfcese primers were developed by Wu ct aL 2 \ Three different 
Utgei DNA a were used u> separate amplifications;-- 50 ng each of 
human DNA that was homozygous for the **ckk (mil (SS). DNA 
that was rieteroryrom for the licMe trak (AS), or DNA dial was 
homozygous for the w,i. ^lobin (AA). Thcrmocycfing was For 30 
cycles at 94*C for 1 min. and 55 6 C for 1 min. iising a "step-cydk" 
program. An annesutttg temperature of 55X hadoccn shown by 
to provide, allete-6pcdnc awplitetion. Completed 
PCR* were prtatographed through a red' niter (V/ratien 23 A) 
after pladhg the rcacoon twbej aiop a model TM-36 tranflfflumi- 
nator {UV-product5 Sah Gabriel, CA>. 

Fhioresecnce measurement. Fluorescence measure roenw were 
mad> on PCRs containing EtBr in a Fluorolog*2 Buorometcr 
^PEX* Edison ( NJ). Excitation was at the &00 nra band with 
ahour 2 nm bandwidth with a GG 435 nm cut-off filter jMeJIcs 
Crist, Inc., Irvine. CA) to exclude second-order light. Emitted 
light was detected at 5?0 nm with a bandwidth of about 7 nm. An 
OG 530 »m cut-nfT filter was used to remove the excxtauon Ug^t 

CcnatimtouA ftuore5ieence m on i toring of PCR, Conp'nuo^ 
monitoring of a PCR in progress was accomplished using rhc 
Bpcctjoftuorometer and •erdngn descrfboa Above as weu as a 
fiberoptic accessory (SFJ&X cat no. 1950) 10 both send exotauou 
light to, and receive emitted light from, a PCR placed in a well oi 
a model 480 thermocyclcr (Fcrkm-Elmer Cetus). The probe e^d 
of the fiberoptic cable was attached with "5 mrn.utc-cpoxy ,p to ite 
open top of a PCR tube (a 0.5 ml polyprooyienc centrifuge tube 
with its cap removed) effectively scaling tL The exposedTtop tfj 
the PCR tube and the end of the fiberoptic cable were shielded 
from room light and the room lights were kept dimmed during 
each run- The monitored PCR was an aoipliricarion of Y-d>TO- 
nwsdrra^pedfk repeat sequences as described above, except 
using .an anncahng/extension tetnperauire of BOX. The reaction 
was coveted with mij>ttrtJ oil (2 drops) to prevent evaporation. 
Tbemocyding and fluorescence rocasurcincnt M'ere started u- 
multancously, A time-base son with a 10 second mtegranoii tnnc 



BK^CHNaOGV VOL 10 *^F3Lt092 



PAGE 5/6 x RCVD AT 7/19/2004 3:10:03 PM pacific Daylight Time] A SVR:SVCS01/0 " DNIS:6638 * CSID:650 952 9881 * DURATION (mm-ss):0446 



JUL-19-S004 13:53 FROM: GEN! 




' EGAL 650 95S 9881 



1246638 



P:6'6 



wa* u*ftd And the embttOB signal was ratioed to the excitation 
nigrtJtl to control for changes in li^ht-sourcc intcnirity. Data were 
^lcdcd using the dro3000f, version 15 (SPEX) data system. 

We fhnnk Bob Jonc* for help with the sjsectrofluormctric 
ijkvmU laments and HeatherbelJ Fong for editing this rbaniucripL 
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IMMUNO BIOLOGICAL LABORATORIES 



sCD-14 EUSA 

Trauma, Shock and Sepsis 




The CD-14 molecule is' expressed on the surface of 
monocytes and some macrophages. Membrane- 
bound CD-14 is a receptor for I ipopofy saccharide 
(LPS) complexed to LPS-Binding-Protein (IBP). The 
concentration of Jts soluble form is altered under 
certain pathological conditions. There- is evidence for 
an important role of $CD-14.with pofytrauma, sepsis, 
burnings and inflammations. 
During septic conditions and acute infections it seems 
to be a prognostic marker and is therefore of value in 
monitoring these patients. 



IBL offers an ELISA for quantitative determination of 

soluble CD-i4 in human serum, -plasma, cell-culture 

supernatants and other biological fluids. 

Assay features: 12x8 determinations 
(microliter strips), 
precoated with a specific 
rtK5noctonal antibody, 
2x1 hour incubation, 
standard range: 3-96 ng/ml 
detection limit: 1 ng/ml 
CV: intra- and interassay < 8% 



For more information call or fax 
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SIMUlTANf OUS AMPLIFICATION AND DETKHOH Of 
SPECIFK DKA SEQUENCES 

Russell Higucbi*, Gavhx Dolliager 1 , P. Sean Walsh and Robert Griffith 

Roche Molecular Systems, the. 1400 55rd St., Emeryvflfe, CA 94608. 'Chiron Corporation, 1400 53rd Sl, EmayvrJIc, CA 
94608. '''Corresponding author. 



We have enhanced the polymerase chain 
reaction (PGR) such that specific DNA 
sequences can be detected without open- 
ing the reaction tube. This enhancement 
requires the addition of ethidium bromide 
(EtBr) to a PGR- Since the fluorescence of 
EtBr increases in the presence of double- 
stranded (d*) DNA an increase in fluores- 
cence in such a PGR indicates a positive 
amplification, which can be e&sily moni- 
tored externally. In fact, amplification can 
be continuously monitored in order to 
follow its progress. The ability to simulta- 
neously amplify specific DNA sequences 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiring high sample through- 
put 



Although the potential be™*** of ^CR 1 to clin- 
ical diagnostics arc well known 2,5 , it i$ still not 
widely used in this setting, even though it is 
fcmr year* cine* thcrxnOftabl? DNA poty™*!-- 
we* 4 made PGR practical. Some of the reasons for its slow, 
acceptance are high cast, tack of automation of pre- and 
post-PCR processing steps, and false positive results, from 
carryovcT<ontamination. The first two points arc related 
in that Ubor is the largest contributor to cost ait the present 
stage of PGR development. Most current assays requite 
botwc form of "downstream" processing once tfcermocy* 
ding is done in order to determine whether tl*e target 
DNA sequence was present and has amplified. These 
include DNA hybridisation*'*, gel ekctrophorc^ or 
without use of restriction digestion*;*, HPLCr, or capillary 
electrophoresis 10 . These methods are labor-intense, bare 
low throughput, and arc difficult to automate. The third 
point is afao closety related to downstream processing. 
The handling of the PGR product in these downstream 
processes increases the chances that aroplified DNA will 
spread through the typing lab, resulting in a risk of 



"carryover" false positives in subsequent testing". 

These downstream processing steps would be elimi- 
nated if specific amplification and detection of amplified 
DNA took place simultaneously within an unopentd re- 
action vessel Assays m whkh such different processes take 
place witbotit.the need to separate reaction components 
have been termed \Trcmogerteous n . Wo truly homoge- 
neous PGR assay has been demonstrated, to date, although 
progress towards this end has been reported. Chehab, et 
al. l % developed a PCR product detection scheme using 
fluorescent primers that resulted in a fluorescent PGR 
product AHcte^pecific primers, each with different fluo- 
rescent tags, were used to indicate- the genotype of the 
DNA. However, the unincorporated primers must still be 
removed in a do wnstream process in order to visualize the 
result Recently, Holland, et al. is , developed an- assay in 
which the endogenous 5 r exdnuciease assay of Taq DNA 
polymerase was exploited to cleave a labeled oJigonucko- 
lide probe. The probe would only dcave if PCR amplifi- 
cation, had produced its complementary sequence. In 
order to decect the dcavage products, however, a subse- 
quent process is again needed, 

We have developed a truly homogeneous assay for PCR 
and PGR product detection based upon the gready in- 
creased fluorescence that ethidium bromide and other 
DNA binding dyes exhibit when they are bound tcvds- 
DNA t4 ~ 19 . As outhned in Figure 1, a prototype PCR 
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IT81IRI I ■Principle of simultancoua amplificanon and detection of 
PCR product The components of a PCR coatwnrnj* EtBr that arc 
fluorescent are listed— EtBrttscJf, Et»r bound to cathcj ssDN A ot 
dsDNA. There w a lar^c ftjwrcsconcc enhancement when EtBr Is 
bound to PNA and brndin^ is greatly enhanced when DNA is 
doubk-straended. After sufficient (n) cydes of PGR* the .net 
increase in <lipNA results in addhiooat £t&r binding and a net 
increase in total fluorescence 
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fWW 2 Gel electrophoresis of PGR amplification products of the 
human, ttudcar gene, HLA DQtt, made in the presence of 
increasing amounts of EtBr (up to 5 N-g/ml). The presence of 
EtBr lias no obvious effect On the yield or specificity of ampliB- 

ClUOll. 
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HOOK % (A) Fluorescence measurement* from PCRs that contain 
0.5 ng/nJ ElBr and that are specific for Y^mtnosome repeat 
sequence*. Five replicate PCfe were begun containing each of the 
DNA* specified. At each indicated cyde, one of the five replicate 
PCfts for each DNA -was removed from therxnocyding and its 
fluorescence measured. Unh» of fluoreacence are arbitrary. (B) 
UV photography of PCR tube* (0.5 ml Eppendorf^tylc, polypro- 
pylene microcentrifuge tubes) containing reactions, those start* 
ing from t ng male DNA Mid control reactions without any DNA, 
from (A), 



414 



begins with primers that are single-stranded DNA (ss- 
DNA), dNTPs, and DNA polymerase* An amount of 
dsDNA containing the target sequence (target DNA) is 
also typically present. This amount can vary, depending 
on the application, from single-cell amounts of DNA 17 to 
micrograms per PGR- 8 , If EtBr is present, the reagents 
that will fluoresce, in order of increasing fluorescence, are 
free EtBr itself, and EtBr bound to the single-stranded 
DNA primers and to the dcaiblc^tranded target DNA (by 
its intercalation between the stacked bases of the DNA 
dooblc-hcux). After the first denatu ration cyde, target 
DNA will be largely sindc-stranded, After a PGR is 
completed, the most significant change is the increase in 
the amount of dsDNA (the PGR product itself) of up to 
several mkro^rarns. Formerly free EtBr is bound to the 
additional dsDNA* resulting in an increase in fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because the binding of EtBr to ssDN A 
is much less than to daDN A, the effect of this change on 
the total fluorescence of the sample is small. The fluores- 
cence increase can be measured by directing excitation 
iUumination through the walls of the amplification vessel 
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before and after, or even continuously during, thermocv 
diny. 1 

RESULTS 

PGR in the presence of EtBr. In order to assess the 
affect of EtBr in PGR, amplifications of the human Hl«A 
DQa gene** were performed with the dye present at 
concentrations from 0.06 to 8.0 u-gfanl (a typiea] concen- 
tration of EtBr used in staining of nucleic aods following 
get electrophoresis is 0*5 u,g/m[). As shown in Figure 2, gel 
electrophoresis revealed little or no difference in the yield 
or quality of the amplification product whether EtBr was 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not inhibit PGR, 

Detection of human Y-chromosoims specific 9t» 
cpences* Sequenee-spedfic, fluorescence enhanoement of 
EtBr as a result of PGR was demonstrated in a series of 
amplifications containing 0.5 u-gftnl EtBr and primers 
specific to repeat DNA sequences found on the human 
Y-chromosomc 80 . These PCRs initially contained either 
60 ng male, 60 ng female, 2 ng mak human or no DNA. 
Five replicate PCRs were begun for each DNA, After 0, 
I7i 21, 24 and 29 cycles of thenuocyding, a PGR for each 
DNA was removed from the thermoeyder, and its. fluo- 
rescence measured in a spectroflnorometer and plotted 
vs. amplification cyde number (Fig. 3 A). The shape of this 
curve reflects the feet that by the rime an increase in 
fluorescence can be detected, the increase in DNA U 
becoming linear and not exponential with cycle number. 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain- 
ing human male DNA, but did not sigiiiflcandy increase 
for negative control PCRs, which contained either no 
DNA or human female DNA. The more male DNA 
present to begin with — 60 ne versus 2 ng— the fewer 
cycles were needed to give a detectable increase in fluo- 
rescence. Gel electrophoresis oo the products of these 
amplifications showed that DNA fragments of the ex- 
pected she were made in the male DNA containing 
reactions and that Me DN A synthesis took place in the 
control samples. 

In addition, the increase in. fluorescence was visualized 
by simply laying the completed, unopened PCRs on a UV 
ttansilhiminatOT and photographing them through a red 
Alter. This is shown in figure SB lor the reactions thai 
began with 2 ng male DNA and those with no DNA. 

Detection of specific allele* of the human 0-globui 
gene. In order to demonstrate that this approach has 
adequate specificity to allow genetic screening, a defection 
of the skkle-cdl anemia mutation was performed. Figure 
4 shows the fluorescence from completed ampMcations 

containing EtBr (0.5 KgfoU) as detected by photography 

of the reaction cubes on a UV cransilJuminator, These 
reactions were performed using* primer* specific for ci- 
ther the wiW-tvpe or sickle-ceil mutation of the human 
P-globin gene". The specificity for each allele is imparted 
by placing the sickle-mutation site at the terminal V 
nucleotide of one primer. By using an appropriate primer 
annealing temperature, primer excension— and thus an> 
plincaticrit--<an take place only if the 5' nucleotide of the 
primer is complementary to the 0-giobui allele present*' 

Each pair of ampltficadons shown in Figure 4 consists of 
a reaction with either the wild-type allele specific (left 
tube) or sickic-alleie specific (right tube) primers. Three 
different DNAs were typed: DNA from a homozygous! 
wild-type $~globin individual (A A); from a heterozygous 
sickle £~£h?bin individual (AS); and from a homozygous 
sickle £-gic*b?n individual (SS). Each DNA (50 ng genomic 
DNA to start each PGR) was analyzed m triplicate (3 pairs 
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0 f reactions each). The DNA .type vas reflected in the 
relative fluorescence intensities in each pah- of completed 
flfn p]ificatk>ii6. There was a significant increase in fluores- 
^oc only where a p-globin allele DNA matched the 
primer act. When measured - oa a spcctroflnoronietcr 
Mata not shown), this fluorescence was about three times 
[j*t present in a PCR where both p-globm alteics were 
mismatched to the primer set* Gel ctcxarophorcste (aot 
pfcown) established that this increase in fluorescence was 
due to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for p-jjlobin. There was 
little synthesis of dsDNA in reactions hv which the allele- 
upedfic primer was mismatched to both alleles. 

Continuous monitoring of a PGR* Using a fiber optic 
devker it possible to direct excitation illumination from 
p spectrofl uoromete r to a PGR undergoing thennocydiog 
and to return its fluorescence to the RpectroftuofOtrieter. 
The fluorescence readout of such an arrangement, di- 
rected Rt an EtBr*containmg amplification of Y-chromo- 
sorac spoci6c sequences from 25 of hniuan male DNA# 
is shown in Figure 5. The readout from a control f*CR 
w ith no target DNA is also shown. Thirty cycles of PCR 
were monitored for each- 

The fhiorcsccncc trace as a function of time dearly 
shows the effect of the thermocyding. Fluorescence inten- 
sity rises and. foils inversely with temperature. The fluo- 
rescence intensity is minimum at the denaturalion tem- 
perature (94°C) and maximum at the anneaUn ^extension 
temperature (50°C). In the negative-control FCR> these 
fluorescence maxima and dud ima do not change signifi- 
cantly over the thirty tbennocycte, indicating that there is 
tittle dsDNA synthesis without the appropriate target 
DNA, and there is little if any bleaching of EtBr during 
the continuous illumination Of the sample. 

In the PGR containing male DNA, the fluorescence 
maxima at the annealmgtextfcnsion temperature begin to 
increase at about 4000 seconds of thexroocycung, and 
continue to increase with time, indicating that dsDNA is 
being produced at a detectable level. Note that the fluo- 
rescence minima at the denaturation temperature do not 
aignificandy increase, presumably because at thh temper- 
ature there is no dsDNA for EtBr to bind. Thus the course 
of the amplification is followed by tracking the ftuorcs-. 
cence increase at the aoneaKftg temperature. Analysis of 
i.hc products of these two amplifications by gel electropho- 
resis showed a DNA fragment of the ejepected size for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sample. 

DISCUSSION 

Downstream processes such as hybridization to a se- 
quence-specific probe can enhance die specificity of DNA 
deicixiuu hy PCR. The cttnnoatkm of ihcac proccroca. 
means that' the specificity of this homogeneous assay 
depends solely on ihat of FCR. In the case of skkle-celi 
disease, wc have shown that PGR ak>oc has sufficient DNA 
sequence apedfidty to permit genetic screening. Using 
appropriate amplification conditions, there is tittle non* 
specific production of dsDNA in the absence of the 
appropriate target allele. 

rhc spedfidty required to detect pathogens can be 
more or less than that required* to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with the 
sample. A difficult target is HIV, which requires detection 
of a viral genome that can be at the level of a few copies 
per thousands of host cells 6 . Compared with genetic 
screening, which is performed on ceils containing at least 
one copy of die target sequence* HIV :detection requires 
both more specificity and the input of mote (oral 
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UV photography of PCR tubes comainmg ainjpBficwroiis 

using EtBr that art specUtc : to wild^type (A) or ricWte (S> aMdes <rf" 
the mimao £-glob»n gene. The left of each jpair of tubes contains 
aBele-spcdfic primers to die wild-type alleles, the right tube 
primers to the sktte atlek- The phme*rat>h was trim after SO 
cycles of PCR, and the input DNAs and the alkies thev coutorti 
are indicated. Fifty tag of DNA was used to begin PCR, Typing 
was done in triplicate (3 p»t» of PCfo) for each hvput DNA: 
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H&O0L 5 Contirittous, realtime monitoring of a PCILAfibcr &ptk 
was oscd to carry excitation Kgnt tn a PCJR to progress and also 
emitted light back to a Booro meter (see Expentticntal Protocol). 
Anrpaucatton ittiug human malo-DN A specific primers in a JO 
starting with 20 ng of human male DNA (top), or in st control 
PCR without DNA (bottom), were monhorcd. Thirty cyde$ of 
KIR were followed for each. The temperature cycled between 
94*C (denaturation) and SOX (annealing and extension). Note in 
the male DNA PCR,. the cycle (time) dependent increase in 
fluorescence at tbc aortfcaHft^extensum temperature. 
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DNA— up to microgram amounts—in order to have suf- 
ficient number* of target sequences. This large amount of 
starting DNA m an amplification signitonUy increases 
the background fluorescence over which any additional 
fluorescence produced by PGR must be detected. An 
additional complication that occurs with targets in tow 
copy-number h the formation of the 4X priraer-<iimer" 
artifact. This is the result of the extension of one primer 
using the other primer as a template. Although this occurs 
infrequently, once it occurs the extension product is a 
substrate for PGR amplification, and can compete with 
true PGR targets if those targets are rare. The primer- 
dimcr product i$ of course d$DNA and thus is a potential 
source of false signal in this homogeneous assay, 

To increase PGR specificity and reduce the effect of 
primer-dimcx antpljffcaoon, we are investiigatipg a num- 
ber of approaches, including fee use of ncsted~prtmer 
amplification* that take place in a single tube 8 , and the 
"hot-start", in which nonspecific amplication ts reduced 
by raising the temperature of the reaction before DNA 
synthesis begins 25 . Preliminary Jesuits using these ap- 
proaches suggest tbat^rimcr-diTOCT" is effectively reduced 
and it is possible to detect the increase in EtBr fluores- 
cence in a PGR instigated by a single HIV genome in a 
background of iO 5 ceils. With larger numbers of cells, the 
background fluorescence contributed by genomic DNA 
becomes problematic. To, reduce this background, it may 
be possible to use sequence ►specific DNA-binding dyes 
that can be made to preferentially bind PGR product over 
genomk DNA by incorporating the dye-binding DNA 
sequence into the PCR product thnvugh a 5' "add-on" to 
the oUgomicleotidc primer*' 1 . 

We nave shown that the detection of fluorescence 
generated by an EtEr-containing PGR is straightforward, 
both once PCR is completed and continuously during 
ihermocycKng. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The fluorescence analysis 
of completed PCRs is alreadvjiossiblc with existing instru- 
mentation in 96 -well format* . In this format* the fluores- 
cence in each PCR can be cjuantitated before, after, and 
even at selected points during therroocyciing by moving 
the rack of PCRs to a 9^rnicrowc)l plate fluorescence 
rcadcrf. 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A direct extension of the apparatus used here is to have 
multiple nberopdes transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number. Figure 3 shows that 
the larger the amount of starting target DNA, the sooner 
during PCR a fluorescence increase Ls detected. Prelimi- 
nary experiments {Wiguchi and Dolfcnger, manuscript in 
preparation) with continuous monitoring have shown a 
sensitivity to two-fold differences in initial target DNA 
concentration. 

Conversely* if the number of target molecules is 
known — as it can be in genetic screening—rcontinuous 
monitoring may provide a means pf detecting fabc posi- 
tive and false negative results. With a known number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR. 
Increases in fluorescence detected before or after that 
cycle would indicate potential artifacts* False negative 
results due to t for example,. inhibition of DNA polymer- 
ase, may be detected by including within each PCR an 
inefficiently amplifying marker. This marker results in a 
fluorescence increase only after a large number of cy- 
cles — many more than arc necessary to detect a true 
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positive. If a sample fails to have a fluorescence increase 
alter this many cycles, inhibition may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorescentce signal alone, such controls rnay 
be important. In any eveat, before any test based on this 
principle is ready for the clinic, an assessment of its false 
positive/false negative rates will need to be obtained using 
a large number of known samples. 

In summary , the inclusion m PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect specific DNA amplification from outside 
the PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
in applications that demand the nigh throughput of 
samples. 

EXPERIMENTAL PROTOCOL 

K cuius HLA-DQci gene ^ptiBcaUons cnatainizig EtBr. 
PCRs were set up inlOO pX volumes containing 10 mM Tris-HCK 
pH 8.3; 50 mM KCI; 4 mM MgO*: t5 unit* of for DNA 
pc4yiuef R9C (Fterkm».Ermcr Genu, Norwalft, CT); 20 piridte each 
of human HtA-DQa gene specific oligonucleotide primers 
OH26 and CH27 19 and approximately W copies of DQ6- PCfc 
product diluted from a previous reaction. EchkHum bromide 
CEihr, SigtttA} was used a( the concentrations indicated ro' Figure. 
2, Thcrmocyding proceeded for 20 cycles in a model 480 
thermoevder (PeTt^-EJmer Cow, Norwalk, CT) using a "rtcp- 
cycJc" program of 94*C for 1 mm*.denaturaUQfi and 60*0 for $0 
sec annealing and 72°C for 30 sec. extension. 

Y-cAromroomc specific PCR. PCRs (J 00 ul total reaction 
volume) containing 0,5 ^g/io) EtBr were prepared as described 
for HLA-DQo, except with different primers and target DNAj. 
These PCRs COTXWtacd I $ pinole each male DN A-spccific primes 
YI.J and Vi.2*°, and cither 60 ng male, €0 eg female, 2 ng male, 
or no human DNA. Thermocyclulg *asSH*CTor 1 min, and 60?C 
for 1 min using a "step-cycle* program. The number of cycles for 
a sample were as indicated in F>gui* 3. Fluorescence measure- 
ment is described below. 

AUc4c«apccinc, human p-gtobin, gewft PCJR* Amplifications of 
100 p4 volume »&n% 0 5 jtgAnl of JUBr were prepared a* 
described for HLA-DQa above except with different primers and 
target DNAs. These PCRs contained eiiW. prjmcr pair HOPS/ 
f?0 HA (viM-typ* globin speculc primers) or HGP2/HfH4$ (skk- 
lc-globin specific primers) at 10 pmole each primer per PCR. 
Tr^ primers were developed by Wu ct aL 21 . Three different 
tatgei DNA* *rent tutcd in separate amplifkatk>nsr--50 ng each of 
human DNA that was homozygous for the *fcWte trait (SS)* DNA 
that was hcteroryrou* for the sickle watt (AS), or DNA that was- 
homozygous for the w.c. gJobin (AA). ThcrmocycBng w*$ For SO 
cycles at 94*0 for 1 min. arod 5$*C for 1 min. usrog o •'stciMTete* 
program. An annealing temperature ot&FC had been .shown by 
Wu et aL 21 to provide, allde^pcrifk amplification- Completed 
PCRs were photographed through a red filter (Wrauen 23A) 
after placing the reaction twbe* atop a model TM*96 tranwHurAi- 
nator (UV- products 5ah' Gabriel, CA). 

Fraoresecnee measnremstit Ftuoresoeiice Tneasureraen is were 
made or» PCRs containing EtBr in a Fluorotog»2 BuoromCtcr 
(SPEX* Edison, NJ). Excitation was at the 500 nra band Kith 
abour 2 nm bandwidth with a GO nm cut-ofTfilter jMcHcs 
Crist inc.. Irvine. CA) to exclude secemd-order light. Etnrtted 
light was detected at 570 nm with a bandwidth of about 7 nm< An 
OG 530 pm cut-off filter was used to remove the excitation hgfet 

CoxitimtotiA ffaKMrencenee m uu i toi' i ng of PCR, Continuous 
monitoring of a PCR in progress was accomplished using the 
Bjpcctrofiuorometcr and settings described Above as well as a 
fiberoptic accessory (SPEX cat no. 1950) to both send cxcUajjon- 
Rgbt to, and receive emitted light from, a PCR placed in a well of 
a model 480 ww-mocycicr (Pcrkm-Elmer Celus). The probe end 
of the fiberoptic cable was attached with "5 mto ute-cpoxy" to the 
open top of a PCR tube (a 0.5 ml potvpropyienc centrifuge tube 
with its cap removed) efTectwely oeahog it The exposecftoP <n 
the PCR tube and the end of the fiberoptic cable were shielded 
frwn room light and the rooco lights were kept dunmed during 
■ each mo. The monitored PCR was an omplincadOn of Y-cbro» 
rnosome-spcdrk repeat sequence* as described above, except 
u$ing.an anncah'ng/cxtension tempera iurc of 50°C. The rcacpcwi 
was covered with mirrerkJ oil (2 drops) to prevent evaporation. 
TberTOOc?cnmir and fluorescence measurement were started si- 
multaneously. A time-base sc*n Wttfi a 10 second kitegrado^ n?nc 
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w m uw*d and ihc embtton signal was ratioed to tbr excitation 
jiignal u> control for change* in light-source intercity. jfrtawcrc 
Reeled using the dro3O0Of, vmicm 1.5 (SFEX) data system. 

We ttanR Bob Jones for help with the sptttrofluonnctric 
inrA^U it* and Heal hct^IIFon^ for cdrting this manuscript. 
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IMMUNO BIOLOGICAL LABORATORIES 



sCD-14 EUSA 

Trauma, Shock and Sepsis 



The CD-14 molecule is' expressed on the surface of 
monocytes and some macrophages. Membrane- 
bound CD-14 is a receptor for I ipopoty saccharide 
(LPS) complexed to LPS-8inding-Protein (LBP). The 
concentration of its soluble form is altered under 
certain pathological conditions. There, is evidence for 
an important roie of sCD-14.with polytrauma, sepsis, 
burnings and inflammations. 
During septic conditions and acute infections it seems 
to be a prpgnostic marker and is therefore of value in 
monitoring these patients. 



IBL offers an ELISA for quantitative determination of 

soluble CD-14 in human serum, -plasma, cell-cuiture 

supernatants and other biological fluids. 

Assay features: 12x8 determinations 
(micraSter strips), 
precoated with a specific 
rnonocfonai antibody, 
2x1 hour incubation, 
standard range: 3 - 96 ng/ml 
detection limit: 1 ng/ml 
CV: intra- and interassay < 8% 



For more information call or fax 
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with Fluorescent Dyes at 
Opposite Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

Kenneth J. Livak, Susan J.A. Flood, Jeffrey Marmaro, William Giusti, and Karin Deetz 

Perkin-EImer, Applied Biosystems Division, Foster City, California 94404 



The 5' nuclease PCR assay detects the 
accumulation of specific PCR product 
by hybridization and cleavage of a 
double-labeled fluorogenlc probe 
during the amplification reaction. 
The probe Is an oligonucleotide with 
both a reporter fluorescent dye and a 
quencher dye attached. An Increase 
In reporter fluorescence intensity In- 
dicates that the probe has hybridized 
to the target PCR product and has 
been cleaved by the 5'-+3' nucle- 
olytlc activity of Too DNA polymerase. 
In this study, probes with the 
quencher dye attached to an Internal 
nucleotide were compared with 
probes with the quencher dye at- 
tached to the 3 '-end nucleotide. In all 
cases, the reporter dye was attached 
to the 5' end. All Intact probes 
showed quenching of the reporter 
fluorescence. In general, probes with 
the quencher dye attached to the 3 - 
end nucleotide exhibited a larger sig- 
nal In the 5' nuclease PCR assay than 
the Internally labeled probes. It Is 
proposed that the larger signal Is 
caused by Increased likelihood of 
cleavage by Tog DNA polymerase 
when the probe Is hybridized to a 
template strand during PCR. Probes 
with the quencher dye attached to 
the 3 '-end nucleotide also exhibited 
an Increase In reporter fluorescence 
intensity when hybridized to a com- 
plementary strand. Thus, oligonucle- 
otides with reporter and quencher 
dyes attached at opposite ends can 
be used as homogeneous hybridiza- 
tion probes. 



1^ homogeneous assay for detecting 
the accumulation of specific PCR prod- 
uct that uses a double-labeled fluoro- 
genic probe was described by Lee et al. <l) 
The assay exploits the 5' 3' nucle- 
olytic activity of Taq DNA poly- 
merase^ and is diagramed in Figure 1. 
The fluorogenic probe consists of an oli- 
gonucleotide with a reporter fluorescent 
dye, such as a fluorescein, attached to 
the 5' end; and a quencher dye, such as a 
rhodamine, attached internally. When 
the fluorescein is excited by irradiation, 
its fluorescent emission will be 
quenched if the rhodamine is close 
enough to be excited through the pro- 
cess of fluorescence energy transfer 
(FET), (4 - 5> During PCR, if the probe is hy- 
bridized to a template strand, Taq DNA 
polymerase will cleave the probe be- 
cause of its inherent 5' 3' nucleotytic 
activity. If the cleavage occurs between 
the fluorescein and rhodamine dyes, it 
causes an increase in fluorescein fluores- 
cence intensity because the fluorescein 
is no longer quenched. The increase in 
fluorescein fluorescence intensity indi- 
cates that the probe-specific PCR product 
has been generated. Thus, FET between a 
reporter dye and a quencher dye is criti- 
cal to the performance of the probe in 
the 5' nuclease PCR assay. 

Quenching is completely dependent 
on the physical proximity of the two 
dyes. (6) Because of this, it has been as- 
sumed that the quencher dye must be 
attached near the 5' end. Surprisingly, 
we have found that attaching a rho- 
damine dye at the 3' end of a probe 
still provides adequate quenching for 
the probe to perform in the 5' nuclease 



PCR assay. Furthermore, cleavage of this 
type of probe is not required to achieve 
some reduction in quenching. Oligonu- 
cleotides with a reporter dye on the 5' 
end and a quencher dye on the 3' end 
exhibit a much higher reporter fluores- 
cence when double-stranded as com- 
pared with single-stranded. This should 
make it possible to use this type of dou- 
ble-labeled probe for homogeneous de- 
tection of nucleic acid hybridization. 



MATERIALS AND METHODS 
Oligonucleotides 

Table 1 shows the nucleotide sequence 
of the oligonucleotides used in this 
study. Linker arm nucleotide (LAN) 
phosphoramidite was obtained from 
Glen Research. The standard DNA phos- 
phoramidites, 6-carboxyfluorescein (6- 
FAM) phosphoraniidite, 6-carboxytet- 
ramethylrhodamine sucdnimidyl ester 
(TAMRA NHS ester), and Phosphalink 
for attaching a 3'-blocking phosphate, 
were obtained from Perkin-EImer, Ap- 
plied Biosystems Division. Oligonucle- 
otide synthesis was performed using an 
ABI model 394 DNA synthesiser (Applied 
Biosystems). Primer and complement 
oligonucleotides were purified using 
Oligo Purification Cartridges (Applied 
Biosystems). Double-labeled probes were 
synthesized with 6-FAM-labeied phos- 
phoramidite at the S' end, LAN replacing 
one of the T's in the sequence, and Phos- 
phalink at the 3' end. Following de- 
protection and ethanol precipitation, 
TAMRA NHS ester was coupled to the 
UN-containing oligonucleotide in 2S0 
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FIGURE 1 Diagram of 5'" nuclease assay. Stepwise representation of the 5- - 

Svity of Taq DNApolymerase acting on a fluorogenic probe during one extension phase of PCR. 



mM Na-bicaibonate buffer (pH 9.0) at 
room temperature. Unreacted dye was 
removed by passage over a PD-10 Sepha- 
dex column. Finally, the double-labeled 
probe was purified by preparative high- 
performance liquid chromatography 
(HPLC) using an Aquapore C 8 220x4.6- 
mm column with particle size. The 
column was developed with a 24-min 
linear gradient of 8-20% acetonitrile in 
0.1 m TEAA (triethylamine acetate). 
Probes are named by designating the se- 
quence from Table 1 and the position of 
the LAN-TAMRA moiety. For example, 
probe Al-7 has sequence Al with LAN- 
TAMRA at nucleotide position 7 from the 
5' end. 



PCR Systems 

All PCR amplifications were performed 
In the Perkin-Elmer GeneAmp PCR Sys- 
tem 9600 using 50-jtl reactions that con- 
tained 10 mM Trls-HCl (pH 8.3), 50 mM 
KC1, 200 jjlm dATP, 200 ^ dCTP, 200 u.M 
dGTP, 400 uM dUTP, 0.5 unit of AmpEr- 
ase uracil N-giycosylase (Perkin-Elmer), 
and 1.25 unit of AmpliTaq DNA poly- 
merase (Perkin-Elmer). A 295-bp seg- 
ment from exon 3 of the human (i-actin 



gene (nucleotides 2141-2435 In the se- 
quence of Naka|lma-ll|ima et al.) (7) was 
amplified using primers AFP and ARP 
(Table 1), which are modified slightly 
from those of du Breuil et al. (B) Actin am- 
plification reactions contained 4 mM 
MgCI 2 , 20 ng of human genomic DNA, 
50 nM Al or A3 probe, and 300 nM each 



primer. The thermal regimen was 50 & C 
(2 min), 95°C (10 min), 40 cycles of 9S°C 
(20 sec) r 60°C (1 min), and hold at 72°C. 
A 515-bp segment was amplified from a 
plasmid that consists of a segment of X 
DNA (nucleotides 32,220-32,747) In- 
serted In the Smal site of vector pUC119. 
These reactions contained 3.5 mM 
MgCl* 1 ng of plasmid DNA, 50 nM P2 or 
P5 probe, 200 nM primer F119, and 200 
nM primer R119. The thermal regimen 
was 50°C (2 min), 95°C (10 min), 25 cy- 
cles of 95°C (20 sec), 57°C (1 min), and 
hold at 72°C. 



Fluorescence Detection 

For each amplification reaction, a 40-uJ 
aliquot of a sample was transferred to an 
individual well of a white, 96-well micro- > 
titer plate (Perkin-Elmer). Fluorescence 
was measured on the Perkin-Elmer Taq- 
Man LS-50B System, which consists of a 
luminescence spectrometer with plate 
reader assembly, a 485-nm excitation fil- 
ter, and a 515-nm emission filter. Excita- 
tion was at 488 nm using a 5-nm silt 
width. Emission was measured at 518 
nm for 6-FAM (the reporter or R value) 
and 582 nm for TAMRA (the quencher or 
Q value) using a 10-nm slit width. To 
determine the increase in reporter emis- 
sion that is caused by cleavage of the 
probe during PCR, three normalizations 
are applied to the raw emission data. 
First, emission Intensity of a buffer blank 
is subtracted for each wavelength. Sec- 
ond, emission intensity of the reporter is 



TABLE 1 Sequences of Oligonucleotides 



Name 


Type 


F119 


primer 


R119 


primer 


P2 


probe 


P2C 


complement 


I>5 


probe 


P5C 


complement 


AFP 


primer 


ARP 


primer 


Al 


probe 


A1C 


complement 


A3 


probe 


A3C 





Sequence 



ACCCACAGGAACTGATCACCACTC 

ATGTCGCGTTCCGGCTGACGTTCTGC 

TCGCATTACl GATCGTfGCCAACCAGTp 

GTACTGGTTGGCAACGATCAGTAATGCGATG 

CGGA'ITTGCTGGTATCTATGACAAGGATp 

TTCATCCTTGTCATAGATACCAGCAAATCCG 

TCACCCACACTGTGCCCATCTACGA 

CAGCGGAACCGCrCATTGCCAATGG 

AIGCCCTCCCCCATGCCATCCTGCGTp 

AGACGCAGGATGGCATGGGGGAGGGCATAC 

CGCCCrGGACTTCGAGCAAGAGAJp 
CCATCfCTTGCTCGAAGTCCAGGGCGAC 



For each oligonucleotide used in this study, the nucleic add sequence » given J ^inen m Xh 
T^ 3' direction. There are three types of oligonucleotides: PCR primer, fluo ^ enl ^^ ^ 
n the 5' nuclease assay, and complement used to hybridize to *<<£^^ 
probes, the underlined base Indicates a position where IAN with TAMRA attached was sup> 
tuted for a T. (p) The presence of a 3' phosphate on each probe. 
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Probe 



A1-2 

A1-7 

A1-14 

A1-19 

A1-22 

A1-26 



518 nm 

no temp. +temp. 



raqgccctcccccatgccatcctgcgtp 
ratgcccqc'ccccatgccatcctgcgtp 
ratgccctcccccaqgccatcctgcgtp 
RatgccctcccccatgccaQcctgcgtp 
RatgccctcccccatgccatccQgcgtp 
RatgccctcccccatgccatcctgcgQp 



582 nm 
no temp. +■ temp. 



RO- 



TO* 



ARQ 



A1 -2 25.5 ±2.1 32.7 ± 1 .9 38.2 ± 3.0 38.2 ± 2.0 0.67 ± 0.01 0.B6 ± 0.06 

A1-7 53.5± 4.3 395.1 ±21.4 108.5±6.3 110.3±5.3 0.49±0.03 3.5810.17 

A1-14 127.0± 4.9 403.5 ±19,1 10B.7 ± 5.3 93.1 ± 6.3 

A1-19 187.5±17.9 422.7 ±7.7 70.3±7.4 73.0±2.8 

A1-22 224.6 ±9.4 482,2 ±43.6 100.0 ±4.0 96.2 ± 9.6 2.25 ±0.03 5.02 ±0.11 

A1-26 160.2±8.9 454.1118.4 93.1±5.4 90.7±3.2 1.72±0.02 5.01 ±0.08 



1.16 ±0.02 4.34 ±0.15 
2.67 ±0.05 5.80 ±0.15 



0.19 + 0.06 
3.09±0.18 
S>18±0.15 
3.13 ±0.16 
2.77 ±0.12 
3.29 ±0.08 



FIGURE 2 Results of 5' nuclease assay comparing p-actin probes with TAMRA at different nucle- 
otide positions. As described in Materials and Methods, PCR amplifications containing the in- 
dicated probes were performed, and the fluorescence emission was measured at 518 and 582 nm. 
Reported values are the average ±1 s.D. for six reactions run without added template (no temp.) 
and six reactions run with template (+temp.). The RQ ratio was calculated for each individual 
reaction and averaged to give the reported RQ" and RQ + values. 



divided by the emission intensity of the 
quencher to give an RQ ratio for each 
reaction tube. This normalizes for well- 
to-well variations in probe concentra- 
tion and fluorescence measurement. Fi- 
nally, ARQ is calculated by subtracting 
the RQ value of the no-template control 
(RQ") from the RQ value for the com- 
plete reaction including template 
(RCT). 

RESULTS 

A series of probes with increasing dis- 
tances between the fluorescein reporter 
and rhodamine quencher were tested to 
investigate the minimum and maximum 
spacing that would give an acceptable 
performance in the 5' nuclease PCR as- 
say. These probes hybridize to a target 



sequence in the human p-actin gene. 
Figure 2 shows the results of an experi- 
ment in which these probes were in- 
cluded in PCR that amplified a segment 
of the p-actin gene containing the target 
sequence. Performance In the 5' nu- 
clease PCR assay is monitored by the 
magnitude of ARQ, which is a measure 
of the increase in reporter fluorescence 
caused by PCR amplification of the 
probe target. Probe Al-2 has a ARQ value 
that is close to zero, indicating that the 
probe was not cleaved appreciably dur- 
ing the amplification reaction. This sug- 
gests that with the quencher dye on the 
second nucleotide from the 5' end, there 
is insufficient room for Taq polymerase 
to cleave efficiently between the reporter 
and quencher. The other five probes ex- 
hibited comparable ARQ values that are 



clearly different from zero. Thus, ail five 
probes are being cleaved during PCR am* 
plification resulting in a similar increase 
in reporter fluorescence. It should be 
noted that complete digestion of a probe 
produces a much larger increase in re- 
porter fluorescence than that observed 
in Figure 2 (data not shown). Thus, even 
In reactions where amplification occurs, 
the majority of probe molecules remain 
uncleaved. It is mainly for this reason 
that the fluorescence Intensity of the 
quencher dye TAMRA changes little with 
amplification of the target. This is what 
allows us to use the 582-nm fluorescence 
reading as a normalization factor. 

The magnitude of RQ" depends 
mainly on the quenching efficiency in- 
herent in the specific structure of the 
probe and the purity of the oligonucle- 
otide. Thus, the larger RQ" values indi- 
cate that probes Al-14, Al-19, Al-22, and 
Al-26 probably have reduced quenching 
as compared with Al-7. Still, the degree 
of quenching is sufficient to detect a 
highly significant increase in reporter 
fluorescence when each of these probes 
is cleaved during PCR. 

To further investigate the ability of 
TAMRA on the 3' end to quench 6-FAM 
on the 5' end, three additional pairs of 
probes were tested in the 5' nuclease 
PCR assay. For each pair, one probe has 
TAMRA attached to an internal nucle- 
otide and the other has TAMRA attached 
to the 3' end nucleotide. The results are 
shown in Table 2. For all three sets, the 
probe with the 3' quencher exhibits a 
ARQ value that is considerably higher 
than for the probe with the internal 
quencher. The RQ" values suggest that 
differences in quenching are not as great 
as those observed with some of the Al 
probes. These results demonstrate that a 
quencher dye on the 3' end of an oligo- 
nucleotide can quench efficiently the 



TABLE 2 Results of 5' Nuclease Assay Comparing Probes with TAMRA Attached to an Internal or 3'-terminal Nucleotide 



518 nm 



582 nm 



Probe 


no temp. 


+ temp. 


no temp. 


+ temp. 


RQ" 


RQ + 


ARQ 


A3-6 


54.6 ± 3.2 


84.8 ± 3.7 


116.2 ± 6.4 


115.6*2.5 


0.47 ± 0.02 


0.73 ± 0.03 


0.26 ± 0.04 


A3-24 


72.1 ± 2.9 


236.5 ± 11,1 


84.2 ± 4.0 


90.2 ± 3.8 


0.86 ± 0.02 


2.62 ± 0.05 


1.76 ± 0.05 


P2-7 


B2.8 ± 4.4 


384.0 ± 34.1 


105.1 ± 6.4 


120.4 ± 10.2 


0.79 ± 0.02 


3.19 + 0.16 


2.40 ±0.16 


P2-27 


113.4 ± 6.6 


555.4 ± 14.1 


140.7 ± 8.5 


118.7 ±4.8 


0.81 ± 0.01 


4.68 ± 0.10 


3.88 ± 0.10 


P5-10 


77.5 ± 6.5 


244.4 ± 15.9 


86.7 ± 4.3 


95.8 ± 6.7 


0.89 ± 0.05 


2,55 ± 0.06 


1.66 ± 0.08 


P5-28 


64.0 ± 5.2 


333.6 ±12.1 


100.6 ± 6.1 


94.7 ± 6.3 


0.63 ± 0.02 


3.53 ± 0.12 


2.89 ± 0.13 



Reactions containing the indicated probes and calculations were performed as described in Material and Methods and in the legend to Fig. 2. 
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fluorescence of a reporter dye on the 5' 
end. The degree of quenching is suffi- 
cient for this type of oligonucleotide to 
be used as a probe in the 5' nuclease PCR 
assay. 

To test the hypothesis that quenching 
by a 3' TAMRA depends on the flexibility 
of the oligonucleotide, fluorescence was 
measured for probes in the single- 
stranded and double-stranded states. Ta- 
ble 3 reports the fluorescence observed 
at S18 and 582 nm. The relative degree 
of quenching is assessed by calculating 
the RQ ratio. For probes with TAMRA 
6-10 nucleotides from the 5' end, there 
is little difference in the RQ values when 
comparing single-stranded with double- 
stranded oligonucleotides. The results 
for probes with TAMRA at the 3' end are 
much different. For these probes, hy- 
bridization to a complementary strand 
causes a dramatic increase in RQ. We 
propose that this loss of quenching is 
caused by the rigid structure of double- 
stranded DNA, which prevents the 5' 
and 3' ends from being in proximity. 

When TAMRA is placed toward the 3' 
end, there is a marked Mg 2 * effect on 
quenching. Figure 3 shows a plot of ob- 
served RQ values for the Al series of 
probes as a function of Mg 2 * concentra- 
tion. With TAMRA attached near the 5' 
end (probe Al-2 or Al-7), the RQ value at 
0 mM Mg 2 * is only slightly higher than 
RQ at 10 mM Mg 2 *. For probes Al-19, 
Al-22, and Al-26, the RQ values at 0 mM 
Mg 2 * are very hlgh r indicating a much 



reduced quenching efficiency. For each 
of these probes, there is a marked de- 
crease in RQ at 1 mM Mg 2 "*" followed by 
a gradual decline as the Mg 2 * concen- 
tration increases to 10 mM. Probe Al-14 
shows an Intermediate RQ value at 0 mM 
Mg 2 " 1 " with a gradual decline at higher 
Mg 2 * concentrations. In a low-salt en- 
vironment with no Mg 2+ present, a sin- 
gle-stranded oligonucleotide would be 
expected to adopt an extended confor- 
mation because of electrostatic repul- 
sion. The binding of Mg 2 * ions acts to 
shield the negative charge of the phos- 
phate backbone so that the oligonucle- 
otide can adopt conformations where 
the 3' end is close to the 5' end. There- 
fore, the observed Mg 2 * effects support 
the notion that quenching of a 5' re- 
porter dye by TAMRA at or near the 3' 
end depends on the flexibility of the oli- 
gonucleotide. 

DISCUSSION 

The striking finding of this study is that 
it seems the rhodamine dye TAMRA, 
placed at any position in an oligonucle- 
otide, can quench the fluorescent emis- 
sion of a fluorescein (6-FAM) placed at 
the 5' end. This implies that a single- 
stranded, double-labeled oligonucle- 
otide must be able to adopt conforma- 
tions where the TAMRA is close to the 5 f 
end. It should be noted that the decay of 
6-FAM In the excited state requires a cer- 
tain amount of time. Therefore, what 



TABLE 3 Comparison of Fluorescence Emissions of Single-stranded and 
Double-stranded Fluorogenic Probes 



518 nm 



582 nm 



RQ 



Probe 


ss 


ds 


ss 


Us 


ss 


ds 


Al-7 


2775 


68.53 


61.08 


138.18 


0.45 


0.50 


Al-26 


43.31 ' 


509.38 


53.50 


93.86 


0.81 


5.43 


A3-6 


16.75 


62.88 


39.33 


165.57 


0.43 


0.38 


A3-24 


30.05 


578.64 


67.72 


140.25 


0.45 


3.21 


P2-7 


35.02 


70.13 


54.63 


121.09 


0.64 


0.58 


P2-27 


39.89 


320.47 


65.10 


61.13 


0.61 


5.25 


PM0 


27.34 


144.85 


61.95 


165.54 


0.44 


0.87 


P5-28 


33.65 


462.29 


72.39 


104.61 


0.46 


4.43 



(ss) Single-stranded. The fluorescence emissions at 518 or 582 nm for solutions containing a final 
concentration of 50 nM indicated probe, 10 mM Tris-HCl (pH 8.3), 50 mM KC1, and 10 mM MgCl 2 . 
(ds) Double-stranded. The solutions contained, in addition, 100 nM A1C for probes Al-7 and 
Al-26, 100 nM A3C for probes A3-6 and A3-24, 100 nM P2C for probes P2-7 and P2-27. or 100 DM 
P5C for probes P5-10 and P5-28. Before the addition of MgCl 2) 120 al of each sample was heated 
at 95°C for 5 min. Following the addition of 80 al of 25 mM MgCl 2 , each sample was allowed to 
cool to room temperature and the fluorescence emissions were measured. Reported values are 
the average of three determinations. 



matters for quenching is not the average 
distance between 6-FAM and TAMRA 
but, rather, how close TAMRA can get to 
6-FAM during the lifetime of the 6-FAM 
excited state. As long as the decay time of 
the excited state Is relatively long com- 
pared with the molecular motions of the 
oligonucleotide, quenching can occur. 
Thus, we propose that TAMRA at the 3' 
end, or any other position, can quench 
6-FAM at the 5' end because TAMRA is in 
proximity to 6-FAM often enough to be 
able to accept energy transfer from an 
excited 6-FAM. 

Details of the fluorescence measure- 
ments remain puzzling. For example, Ta- 
ble 3 shows that hybridization of probes 
Al-26, A3-24, and P5-28 to their comple- 
mentary strands not only causes a large 
increase in 6-FAM fluorescence at 518 
nm but also causes a modest increase in 
TAMRA fluorescence at 582 nm. If 
TAMRA Is being excited by energy trans- 
fer from quenched 6-FAM, then loss of 
quenching attributable to hybridization 
should cause a decrease in the fluores- 
cence emission of TAMRA. The fact that 
the fluorescence emission of TAMRA in-, 
creases indicates that the situation is 
more complex. For example, we have an- 
ecdotal evidence that the bases of the 
oligonucleotide, especially G, quench! 
the fluorescence of both 6-FAM and 
TAMRA to some degree. When double- 
stranded, base-pairing may reduce the 
ability of the bases to quench. The pri- 
mary factor causing the quenching of 
6-FAM in an intact probe is the TAMRA 
dye. Evidence for the importance of 
TAMRA is that 6-FAM fluorescence 
remains relatively unchanged when 
probes labeled only with 6-FAM are used 
in the 5' nuclease PCR assay (data not 
shown). Secondary effectors of fluores- 
cence, both before and after cleavage of 
the probe, need to be explored further. 

Regardless of the physical mecha- 
nism, the relative independence of posi- 
tion and quenching greatly simplifies 
the design of probes for the 5' nuclease 
PCR assay. There are three main factors 
that determine the performance of a; 
double-labeled fluorescent probe in the 
5' nuclease PCR assay. The first factor is 
the degree of quenching observed in the 
intact probe. This is characterized by the 
value of RQ " , which is the ratio of re- 
porter to quencher fluorescent emis- 
sions for a no template control PCR. In- 
fluences on the value of RQ" include 
the particular reporter and quencher 
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FIGURE 3 Effect of Mg 2+ concentration on RQ ratio for the Al series of probes. The fluorescence 
emission intensity at 518 and 582 nm was measured for solutions containing 50 nM probe, 10 mM 
Tris-HCl (pH 8.3), 50 mM KG, and varying amounts (O-10 mM) of MgCI 2 . The calculated RQ 
ratios (518 nm intensity divided by 582 nm intensity) are plotted vs. MgCI 2 concentration (mM 
Mg). The key (upper right) shows the probes examined. 



dyes used, spacing between reporter and 
quencher dyes, nucleotide sequence 
context effects, presence of structure or 
other factors that reduce flexibility of 
the oligonucleotide, and purity of the 
probe. The second factor is the efficiency 
of hybridization, which depends on 
probe T mt presence of secondary struc- 
ture in probe or template, annealing 
temperature, and other reaction condi- 
tions. The third factor is the efficiency at 
which Taq DNA polymerase cleaves the 
bound probe between the reporter and 
quencher dyes. This cleavage is depen- 
dent on sequence complementarity be- 
tween probe and template as shown by 
the observation that mismatches in the 
segment between reporter arid quencher 
dyes drastically reduce the cleavage of 
probe. 10 

The rise in RQ" values for the Al se- 
ries of probes seems to indicate that the 
degree of quenching is reduced some- 
what as the quencher is placed toward 
the 3 r end. The lowest apparent quench- 
ing is observed for probe Al-19 (see Fig. 
3) rather than for the probe where the 
TAMRA is at the 3' end (Al-26). This is 
understandable, as the conformation of 
the 3' end position would be expected to 
be less restricted than the conformation 
of an internal position, in effect, a 
quencher at the 3' end is freer to adopt 
conformations close to the 5' reporter 
dye than is an internally placed 
quencher. For the other three sets of 



probes, the interpretation of RQ" values 
is less clear-cut. The A3 probes show the 
same trend as Al, with the 3' TAMRA 
probe having a larger RQ~ than the In- 
ternal TAMRA probe. For the P2 pair, 
both probes have about the same RQ" 
value. For the P5 probes, the RQ" for the 
3' probe is less than for the internally 
labeled probe. Another factor that may 
explain some of the observed variation is 
that purity affects the RQ" value. Al- 
though all probes are HPLC purified, a 
small amount of contamination with 
unquenched reporter can have a large ef- 
fect on RQ". 

Although there may be a modest ef- 
fect on degree of quenching, the posi- 
tion of the quencher apparently can 
have a large effect on the efficiency of 
probe cleavage. The most drastic effect is 
observed with probe A 1-2, where place- 
ment of the TAMRA on the second nu- 
cleotide reduces the efficiency of cleav- 
age to almost zero. For the A3, P2, and PS 
probes, ARQ is much greater for the 3' 
TAMRA probes as compared with the in- 
ternal TAMRA probes. This is explained 
most easily by assuming that probes 
with TAMRA at the 3' end are more likely 
to be cleaved between reporter and 
quencher than are probes with TAMRA 
attached internally. For the Al probes, 
the cleavage efficiency of probe Al-7 
must already be quite high, as ARQ does 
not Increase when the quencher is 
placed closer to the 3' end. This illus- 



trates the importance of being able to 
use probes with a quencher on the 3' 
end in the 5' nuclease PCR assay. In this 
assay, an increase in the intensity of re- 
porter fluorescence is observed only 
when the probe is cleaved between the 
reporter and quencher dyes. By placing 
the reporter and quencher dyes on the 
opposite ends of an oligonucleotide 
probe, any cleavage that occurs will be 
detected. When the quencher is attached 
to an internal nucleotide, sometimes the 
probe works well (Al-7) and other times 
not so well (A3-6). The relatively poor 
performance of probe A3-6 presumably 
means the probe is being cleaved 3 r to 
the quencher rather than between the 
reporter and quencher. Therefore, the 
best chance of having a probe that reli- 
ably detects accumulation of PCR prod- 
uct in the S' nuclease PCR assay is to use 
a probe with the reporter and quencher 
dyes on opposite ends. 

Placing the quencher dye on the 3' 
end may also provide a slight benefit in 
terms of hybridization efficiency. The 
presence of a quencher attached to an 
internal nucleotide might be expected to 
disrupt base-pairing and reduce the T m 
of a probe. In fact, a 2°C-3°C reduction 
in T m has been observed for two probes 
with internally attached TAMRAs. (9) This 
disruptive effect would be minimized by 
placing the quencher at the 3' end. Thus, 
probes with 3' quenchers might exhibit 
slightly higher hybridization efficiencies 
than probes with internal quenchers. 

The combination of Increased cleav- 
age and hybridization efficiencies means 
that probes with 3' quenchers probably 
will be more tolerant of mismatches be- 
tween probe and target as compared 
with internally labeled probes. This, tol- 
erance of mismatches can be advanta- 
geous, as when trying to use a single 
probe to detect PCR-amplified products 
from samples of different species. Also, it 
means that cleavage of probe during PCR 
is less sensitive to alterations in an- 
nealing temperature or other reaction 
conditions. The one application where 
tolerance of mismatches may be a disad- 
vantage is for allelic discrimination. Lee 
et al. (1> demonstrated that allele-spedfic 
probes were cleaved between reporter 
and quencher only when hybridized to a 
perfectly complementary target. This al- 
lowed them to distinguish the normal 
human cystic fibrosis allele from the 
AFS08 mutant. Their probes had TAMRA 
attached to the seventh nucleotide from 
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the 5' end and were designed so that any 
mismatches were between the reporter 
and quencher. Increasing the distance 
between reporter and quencher would 
lessen the disruptive effect of mis- 
matches and allow cleavage of the probe 
on the incorrect target. Thus, probes 
with a quencher attached to an Internal 
nucleotide may still be useful for allelic 
discrimination. 

In this study loss of quenching upon 
hybridization was used to show that 
quenching by a 3' TAMRA is dependent 
on the flexibility of a single-stranded oli- 
gonucleotide. The increase in reporter 
fluorescence intensity, though, could 
also be used to determine whether hy- 
bridization has occurred or not. Thus, 
oligonucleotides with reporter and 
quencher dyes attached at opposite ends 
should also be useful as hybridization 
probes. The ability to detect hybridiza- 
tion in real time means that these probes 
could be used to measure hybridization 
kinetics. Also, this type of probe could be 
used to develop homogeneous hybrid- 
ization assays for diagnostics or other ap- 
plications. Bagwell et al. <10 > describe just 
this type of homogeneous assay where 
hybridization of a probe causes an in- 
crease in fluorescence caused by a loss of 
quenching. However, they utilized a 
complex probe design that requires add- 
ing nucleotides to both ends of the 
probe sequence to form two Imperfect 
hairpins. The results presented here 
demonstrate that the simple addition of 
a reporter dye to one end of an oligonu- 
cleotide and a quencher dye to the other 
end generates a fluorogenic probe that 
can detect hybridization or PGR amplifi- 
cation. 
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Wo have developed a novel "real time" quamiwiivc PCR method. The method moanircs KIR product 
accumulation through a duaMabeled Ihioroeenle probe (i.e., TaqMan Prob* This mclhori provides ; vwy 
accuran and reproducible quantitation of piic copies. Unlike octer quantitative PCR methods, real-time PCR 
does nor require poa-PCR sample handling, preventing potential PCR product carry-over contamination and 
resulting in much faster and higher throughput assays. The reaMlm* PCR method has a very large dynamic 
range of starting taruer molecule determination (at k»>i five orders of magnitude). Real-time quantitative 
PCR is extremely accurate and less labor-intensive than airrent quantitative PCR methods. 



Quantitative nucleic acid sequence analysis has 
had an important rule in many fields of biologi- 
cal research. Measurement of gene expression 
(RNA) has been used extensively In monitoring 
biological responses to various stimuli (Tan el al, 
1991; Huanft el al. 1995a,b; Prud'homme et al. 
1995), Quantitative gen* analysis (T;NA) has 
ix-cn ujed tu dr.i ermine the genome quantity of » 
particular gene, as in the case oMhe human HER2 
gene, which Is amplified in -30% of breast tu- 
mors (Slatnon et -al. 1987). Gene and genome 
quantitation (DNA and UNA) also have been used 
for analysis of human immunodeficiency virus 
(11JV) buTden demonstrating changes in the lev- 
els of virus throughout the different phases of the 
disease (Connor et al. 1993; J'latak ct al. J w:sh; 
Purtado et al. 1995)- 

Many methods have been described for the. 
quantitative analysis ot nucleic acid sequences 
(both for RNA and DNA; Southern '19/5; Sharp ct 
al. 1980; Thomas 1980). Recently, PCR has 
proven to be a powerful tool for quantitative 
nucleic acid analysis. PCJR and reverse transcrip- 
tase (KT)-POR have permitted the analysis of 
minimal starting quantities of nucleic acid (as 
little as one cell equivalent). This has made pos- 
sible many experiments that could not hove been 
performed with traditional methods. Although 
PCR has provided a powerful tool, it is imperative. 



thai \\ be loed properly for quantitation (tt»«y- 
maekeTs 1995), Many early reports of quantita- 
tive PCK and RT-PCR described quantitation of 
the PCR product but did not measure the initial 
target sequence quantity. It is essential to design 
proper control* for the quantitation of the initial 
target sequences (Pcrrc 1992; Clement! ct al. 
100?.) 

Ke.swirchers have, developed several methods 
of quantitative PGR and UT-PCR. One approach 
measures PCR product quantity in the l<ig phase 
of the reaction before the plateau (Kellogg et al. 
1990; Pang et til. 1990). This method requires 
thai each sample has equal Input amounts of 
nucleic add and that each sample under analysis 
amplifies with Ulvnl ital efficiency up to the. point 
of quuntiUlivc analysis. A gene sequence (con- 
tained hi all samples ot relatively constant quan- 
Utittt, such as p-actln) can be us«d for sample* 
amplification efficiency normalization. Usin« 
conventional methods of PCR detection and 
quantitation (gd electrophoresis or plate capture 
hybridization), it is exi rfcmely laborious to assure 
that all samples are analyzed during the log phase 
of the reaction (for both the target gene and the 
normalization gene). AnoUier method, quantita* 
tlve competitive (QQ-PCK, has been developed 
and is used widely for PGR quantitation. QC-PCR 
relics on the inclusion of an internal control 
competitor in each reaction (Bockcr-Andrc 1991; 
Platak cL ol. 1993a,b). The efficiency of each re- 
action is normalised to th* internal competitor. 
a innwn si i run nil of inte-mai competitor can be 
anm\i 7nc« no/ «*« wj «c:fcT 7nn7/cn/7T 
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added to each sample. To obtain relative quant- 
tollon, ihe unknown largcl PGR product is com- 
pared with the known competitor iK'M product. 
Success of a quant Malt vc competitive VCli assay 
relies on developing an hitenntl control thcil am- 
jilini^s with the same efficiency as the Uugel mol- 
cculc. TlK design of the compel J toi and Lhv vali- 
dation of amplification efficients jequirc a 
dedicated effort. However, because QC-PCK does 
not require that POtt jhikIucU be analyzed during 
the log phase of Hie. amplification, it is tin: easier 
<.*f the two methods to use. 

Seven* i detection system:* uie used for quan 
tltative l'CR and RT-UCU analysis; (1) agarose 
gch, (2) Auonr&ccnt labeling of KIR products nnd 
detection with I tunn*- induced fluorescence vising 
capillary electrophoresis (hiiseo ct aJ. 1995; WIU 
Hams ct ah 1996) or acrylaiutdc gels, mid (3) plate 
capture*, and sandwich probe hybrid \v*\ \ ion (Mul- 
der et :il. 1994). Although these methods pmv«-d 
successful, each method requires post-PCR ma- 
nipulations That acid Ttm« to the analysis and 
may lead to hiboMitoty * *n 1 1 nnii nation. The 
sample throughput of Ihevr method:* Ls limited 
(wilJi lhv I'xc.cnilon of the plate capture ap- 
preach), tind, therefore., these methods, ore not 
well >uilrd fwj u.to dcni«tnch*n& high sample 
throughput (I.e., .screening of large numbers of 

hlotjiwlcn.i.iI<r:t in ji iiity/.lii^ AiSmplva fwi did£i ita- 
lic* tir clinical trials). 

Mure we report the development of a novel 
iiM&y for quantitative. DNA analys). 1 ). The assay is 
based on 11 i«t u*r. of the nuclrh<u* assay flrM 
described by Holiuud et ah (1993), The method 
wMts ihe 5' nuclease Aeiiviiy of 'i\uf (h Ay mcmsc to 
t:lcavc a n on extendible hybridization probe dur- 
ing the extension pi wise of l'CR- The. iipproiu.li 
uats duitl-labclcd fluoro^enic hybridisation 
probes (Lee. et n). 1!>93; nuaslcr ct ul. lf>93; MvoU 
ct nl. l$9£o,b). One. fluorescent dye nvrve.3 <»» a 
reporter |PAM (i.e., (^Ciirboxyfluoreaecin)! find Hn 
emission spectra is quenched by the second fluo- 
rescent dye, TAMRA (i.*,., o-c;irboxy-tetrAmethyl- 
rhodaminc). The nuclease degradation of the hy- 
bridization probe releases the quenching of the 
I 'AM fluorescent etuis* Ion, resulting in an In- 
pea.k. fluorescent emission at S3 15 nm« 
The use of a sequence d elector (AUT Prism) allows 
measurement of fluorescent spectra of ail 96 wells 
of the mcrmai cycler continuously during the 
1*CK amplification. Therefore, the reuc tious tue 
monitored in real time, The output, data is de- 
scribed and- quantitative uiutlyab of input turget 
UNA sequences 15 discussed below. 



RESULTS 



PCR Product Dercctlon in Rwl Time 

The goal w;ix to develop a high-throughput, sen- 
r-itive, *n<l necuratc gene qti«nlltatlon as«ay for 
use In monitoring lipid mediated therapeutic 
gene delivery. A plasmld unending human factor 
VIII gene sequence, pI«'8TM (sec. Methods), was 
used a> a hkhI^I iheraj>entie g« M <» 'i'ho assay uso<i 
fluorescent Vaqwun methodology an<l an instru- 
ment capable of measuring fluorescencv in real 
time (AIM Prism 7700 Sequence r>cicc*lnr). Ilii! 
Taqman rttacilon requires » JiybrldUc^tion pmlv 
lalxlcd witli two different fluorotieenl dyes. One 
dye Is a reporUir dy« (1**AM), the other is ^ quench- 
ing dye (TAMRA). When the pn»U: \s liilact, fluo- 
icsccnl energy transfer occum and the reporter 
dye fluorescent emission is ubsorbed by the 
quenching dyv (TAWRA). During Die extension 
phase of the PCR cycle, the nuoresccni hybrid- 
l/jdlon prohc is cleaved by ihe S'-.'V nucleolytic 
octivitY of the. ONA polymerase. On dcavagc of 
the probe, the reporter dye emission is no longer 
transferred efficiently to the quenching dye, ic 
suiting In em increase of the report or dyu fluores- 
cent einlxiicm opeCtro. l'CR primers unci probcN 
were designttd Ttn lliu human fcictor VJ1J sc- 
q u en c v a nd h u m a n p*act I n gen e (us de..ien bed i n 
Mctliods). OjHimizntlon reactions were per- 
formed to choose the appropriate probe und 
-mngncsluni concentrations yielding the highest 
Intensity of re|>ortcr fluorescent signtil without 
suerlf Icing specificity. The In si rumen 1 u;;es a 
chArge-cotiplcd device (i.e.. CCD eamcru) for 
mcu^iarlng the fluorescent einisNlon apeetni from 
5iOO to rt50 nit i. ICach VCAX tube was monitored 
setjuentialty fnr 25. rn.suc wid^ continuous moni- 
toring thre>ughotit tlur ampHfieAlipn. liech lube 
won rr.-exa mined every B.5 see. Computer soft- 
ware, was designed to examine the fluorescent In- 
tensity of both the reporter dye (HAM). And 
the quenching dye <TAMRA). Tl»c lluore.sccnt 
intensity of the quenching dy«, TAMUA, changes 
very Utile over the course of the PCfc ampllfl* 
cation (dttttt not shown). l*hcr«fore, the Intensity 
of TAMKA dye omission serves hk an internal 
nlondtird with which to norm ulbve. the reporter 
dyi: (l f AM) cmlflMon varindona. 1*1 Mi software ail- 
culotes «i vdhie U:rmcd ARn (or AHQ) uaing the 
following equation: ARn - (lln J ) (rb»"")i where 
Kn"* ernl.YNlon intensity nf rcportcr/cmissit>n in- 
tensity of quencher at any given time In ft rccic 
tlon tube, and Rn r- emission intensiiity of re- 
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poner/cmisslun Inivnwiy wf quencher measured 
prior to TCH aniplilicalion in lhar same rcactiun 
tube. I'or the purpose of quantitation, the lasi 
three data points (AKm) .collected during the ex* 
tension step for each I'CK eyeie were analyzed. 
The nucleolylic degradation of the nyumliy-iiion. 
probe occurs during ihe extension phase of I'CJt* 
and, therefore, reporter fluorescent inmsMun in- 
creases during this time. 'Hu: Uuw dam polnta 
were averaged for each cycle and the mean 
value for each was plotted in on "amplification 
plot" shown Jn J'ljjurc 1 A. The M<n mean value is 
ploucd on the )*axis, and Time, represented by 
cycle number, is plot I cut on thv*-axis. During the 
early cycles of llic VCA\ amplification, the ARn 



value remains at base line When sufficient hy- 
bridization probe ha* Wficn cleaved by the Tim? 
ixilymcraw: nuflftAftC activity, the intensity of ro. 
porUr fjwwcviccrti emission increase*. MoM PC IK 
amplinvMinM reach u plateau phono of reporter 
f]iiuie5»cv.iil emission if the reHulion Is carriod mil 
lo high cycle nuniK-is. The- a/nplifirallon plot h 
examined eaily iii th* reaction, ut a point lhai 
ic|>icscnts ihv log phase of product arrmnula* 
tion. This Is done hy assigning an nihility 
threshold thji is btucd on the varhibilily of the 
bas«-liiic diiu- In Ngure 1A, the Ihrrahold w&ssci 
ui io standard deviation* above, the mean of 
baao llnfl endssK»n mlculaied from irydo 1 Lo 1 5. 
Once the threshold is chosen, the point at which 
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(*) The Mode! 7700 Mjflware will construct am plifi cat ioo plots 



PCR produci detection in real time. \~/ — .... Tl r *^ M *~4 a* 

from the extension phase fluorescent emission data collected during the PCR amplification. The standard de 
SI?! Llermined'irom the data points collected from the base line of the am pll *™J»*r ^ he 
calculated bv determining the point at which the fluorescence exceeds a threshold llmil (usually 10 times me 
of thcWSne). (8) Overlay of amplification plots of ^^«^"J 
DNA samples amplified with fJ-actin primers. <Q Input DNA concentration of lh»_»«ple| P^,^,^ 
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the amplification plot crofcsco the* thrcshold ivcle 
fined as C,, C r is reported an the cycle number n\ 
this point. Ar will be demonstrutud, the CI, .value 
to piedicUve of ihc quantity of input targttl. 

Values Provide a Quantitative Measurement, of 
Input Targer Sequences 

Plgufc IB shows amplification ploU of 3i»dirY*v. 
cut PGR amplifications overlaid, The amplia- 
tions were performed on a 1:2 serial dilution 'id 
human genomic PNA. The amplified target w:u 
human p actin, The amplification plotis Khifl to 
the right (to higher threshold cycles) »* the input 
target quantity h reduced. 'Jl-ns is expected ho. 
VM\\m nwetinnK with fewer starting copinx of tile 
target molecule require greater amplification to 
degrade enough probe to attain the Threshold 
fluorescence. An arbitrary threshold of 10'stan* 
dard deviations above the base line was used to 
determine the O r values. Figure 1C represents the 
C r values plotted versus the sample dilution 
value, Each dilution was amplified in triplicate 
V( S M amplifications and plotted as mean values 
with error bars representing one standard devia- 
tion. 'Hie C r values decrease linearly wjth increas- 
ing target quantity, THUS, (Rvalues can be used 
as a quantitative measurement of the input target 
number. It should be noted that the amplifica- 
tion plot for the 15.6'iig sample shown In Figure 
1H does not reflect the same fluorescent rate of 
increase exhibited by most of the other samples. 
The I5.6-ng sample also achieves- endpoint pla- 
teau at a lower fluorescent value than would he 
expected based on the input DNA, This phenom- 
enon has been observed, occasionally with other 
samples (data not shown) and may be attribut- 
able to late cycle inhibition; this hypothesis is 
still under investigation. It is important to note 
that the flattened slope and earJy plateau do not 
impact significantly the calculated C, value us 
demonstrated by the fh on Ihr liiu> shown Jn 
Figure 1 C, All triplicate amplifications resulted in 
very similar Cr values— the standard deviation 
did not exceed 0,5 for any dilution. This experi- 
ment contains a > 1 00,000-fold range of Input tar- 
get molecules. Using C v values for quant nation 
permits a much larger assay range than directiy 
using total fluorescent emission intensity for 
quantitation. The linear range. ol lluorcsccnl In- 
tensity measurement of the ABI Frlsm 7700 Se- 

antmj 
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rm»iif£ over n very large r;m$>r of r?Uiivr> starting 
target quantities. . 

Sample Preparation Validation 

Several parameters influence the eflkW-nry of 
PCR amplification: magnesium and sail concen, 
nations, reaction conditions (i.e., time »nd tem- 
perature), FCH target size and composition, 
primer sequences, and sample purity. Ail of The 
above factors are common to a single J'CR assay, 
except sample lo sample purity, in an effort to 
validate (he method of sample preparation lor 
the iacior VJ11 assay, FCK amplification reprndno 
ibility and olflclency ol JO replicate sample 
pvi'jwratioTiR were examined. After genomic DNA 
was prepared from the 10 rfcplicate samples, the 
DNA was qunniUalcd by ultraviolet spectroscopy, 
Amplifications were performed analyzing p-aciln 
j;c.nc. content in 100 and 25 nj; of total genomic 
PNA. Each FCK amplification was performed in 
triplicate. Comparison of C r values for each trip- 
Heate sample show minimal variation based on 
standard deviation and coefficient of variance 
(Table 1). Therefore, each ol the triplicate FCH 
amplifications was highly reproducible, demon- 
strating that re.al time FCK using this Instrumen- 
tation introduces minimal variation Into thi* 
quantitative. FCK analysis. Comparison of the 
mean Gj values of the 10 replicate sample prepa- 
rations also showed minimal variability, indicat- 
ing that each sample preparation yielded sunilar 
results for p-acOn gene quantity. The highest C. T 
difference between any of rhe samples was 0.85 
and 0,7] for the 300 and 25 ng samples, respec- 
llvely. Additionally, the amplification of cad) 
sample exhibited an equivalent rale of fluorcs> 
cent emission intensity change per amount of 
DNA target analyzed as indicaie.d by similar 
slopes derived from Ihc sample dilutions (Pig. 2). 
Any sample containing an excess of a PCX inhibi- 
tor would exhibit a greater measured (3-actin C T 
value for a given quantity of UNA. in addition, 
the inhibitor would be diluted along with the 
.sample in the dilution analysis (Fig, Z), altering 
the expected C ( . value change. Each sample am- 
plification yielded a similar result in the analysis, 
demonstrating that tins method of .sample prepa- 
ration is highly reproducible with regard to 
sample purity. 

Quantitative Analysis of a Plasm id After 

rncR no/ RfcR vvj «c:hT 7nn7/cn/7T 
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Table 1 . Reproducibility of Sample Preparation Method 



1 



7 
3 
9 
10 

Mean 



100 ng 



Samplo 

no. C T 



standard 
m£an deviation 



CV 



18.24 
18.23 
.'10.33 
18.33 
18.35 
1M4 
18.3 
18.3 
18.42 
18,15 
18.23 
1S.32 
T8.4 
18.38 
18.46 

18,54 
18.67 
19 

18.2B 

18.36 

18-S2 

18.45 

18,7 

18.73 

18.18 

18.34 

18.26 

18.42 

18,57 

1 8.66 

<1 10) 



lv.27 



18.23 



18.55 
18.-12 



0.06 
0.06 



18.34 0.07 



0.08 



18.42 0.Q4 



18.74 0,24 



18.39 0.12 



18.63 0.16 



18.29 0.1 



0.12 
0.17 



0.32 
03? 
0.36 

0.46 

0,23 

1.26 

0.66 

0.B3 

0..M 

0.65 
0,S»0 



20.48 

20.55 

20.5 

20.61 

20.59 

?0.41 

20,54 

20.6 

20.49 

20.48 

20.44 

20.38 

20.68 

20.87 

20,63 

21.09 

21.04 

21.04 

20.67 

20,73 

20.65 

20,96 
20.84 
20.75 
20.46 
20.54 
20.48 
20.79 
20.78 
20.62 



25 ng 



standard 
mean deviation CV 



20»51 0.03 0,17 

0.11 0.54 

20.54 0.06 0,28 

20.43 0.05 0.26 

20.73 0.13 0.61 

21.06 0.03 0.15 

20.68 0.04 0.2 

20.86 Q.12 0.57 

20.51 0.07 0.32 

20.73 0.1 0.46 

20.66 0.19 0.94 



tor containing a partial cDNA for human factor 
VIH, pi-'STM. A scries of transections was sot 
up ustnjp a decreasing amount of the plasmid *(40, 
A, 0.5, and O.I u.g), Tvvwmy-rour hours poM- 
trana feet inn, total r>NA wti$ purified from each 
flask uf crib. p-Avliu gene quantity wa* chuMrn a* 
a value for norma ligat ion or Keuoiriii-. DNA con- 
centration frum each sample. In this cxpeiiment, 
(i-actin gene content should remain constant 
relative to roral genomic UNA. Figure 3 shows the 
result of the p-actlu ON A measurement (100 ng 
total DNA determined by ultraviolet spectros- 
copy) of each sun; pic. Kach sample was analysed 
in triplicate and the mean p-actin C n values of 
the triplicates were plotted (error bars represent 

r*'j-f.i«#>rt d^/iuimni 'I h#» htPht*sr iliffrrrnrp 



betvwtaii any iwo sample moans was 0.U5 C,.. Ten 
nanograms of total DNA of each sample were also 
rx.-siitltictd fur p-actln. Hie results again showed 
that very similar amounts of genomic 1>NA were 
present; 'the maximum mean p actio C, value 
difference wa.s 1.0. As J ; igure 3 shows, the rate of 
P- act J is C v change lx.-t.wecn the 100 arid 10-ng 
sample* was similar (slope values rang« butwoon 
3.56 and -3.45). ThSa verifies again thnt the: 
method of sample preparation yields samples of 
identical PCR integrity G'.c-, no sample contained 
an excessive amount of a PCft inhibitor). ITow* 
ever, these results indicate that each sample con 
talned slight dirfeiences in the actual amount Of 
genomic DNA analyzed. Determination of actual 
wenumic DNA umccrd ration was accomplished 
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Figure 2 Sample preparation purity, lhe replicate 
samples shown In Table 1 wore also amplified In 
tripicate using 25 ng of each DNA sample, The fig* 
uit shows the input DNA concent™* ion (100 and 
2$ ng) vs. C, In ihi* Urjnrp, ihp 100 and 75 ng 
poloU for ctach sample are connected by a line. 



by plotting the mean fJ-actin C, value obtained 
for each 100- itg sample mm a p-actln standard 
cuive (shown in Wg. 4C^). The actual genomic 
DNA concentred of each su-mpU;, a, was ob 
taincd by extrapolation to the xuxi*, 

Figure 4 A shows the measured (l.tt., iuui> 
normalised) quantities of factor VI) ] planirnd 
DNA (pltJTM) nom each of tin: four transient cell 
lnfM>fecUrma. Each reaction contained 100 ng of 
total sample DNA (as determined by UV spectros- 
copy), l&ch sample wus analyzed in triplicate 
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Figure 3 Analyst uf tidiisfectcd cell DNA quantity 
and purtty. lhe DNA preparations of the four 293 
celt transactions (40, 4, 0.5, and 0.1 jxg of pFSTM) 
were analy7ed for the (3-actln gene. 100 and 10 ng 
(determined by ultraviolet spectroscopy) of each 
sample were amplified in triplicate. For each 
amount of pF8TM thai was transfectcd, the (4-aciln 
C T values are plotted versus the total Input DNA 



'PC.U amplifications. As shown, pl»'8TM purified 
>hu\v Jbc 29'A cells decreases (mean C, values in- 
CTUi^i';) with decreasing amounts of plasmid 
itruiuii'Ucd, The mean C A values obtained for 
pFbTM in TigufC 4A were plotted on a standard 
curve comprised of seilully diluted pKHVM, 
shown .in figure 4R. The quantity uJ pt'XTM, h, 
found in «och of the four tranfifoctionK was de- 
tcrmined by extrapolation to the x axh; of the 
standard curve In liigure 41*. 'Hutsc uncorrected 
value*, b, for pVKTM were normwll^id to deter- 
mine Uie actual amount of pl'8'lM femud pwr 100 
of genomic DNA by using the equation:. 

I> X 10 0 ng uciual pVWYM copies per 
r 100 ng (if genomic DNA 

where a •- actual genomic DNA in u .sample and 
b «- pl : 8TM copies from the standard curve, The 
normalised quantity of pl'8TM per 100 ng of ge- 
nomic DNA for each of the four 1 ran.Vfcr.Hona l.i 
sftown Hi Figure 4JJ. 'Hi cm: roull* a how men inc 
quantity of factor Vlll ptasiuid associated wiili 
the 293 cells, 21 hr ufter trunsfvclitin. di:i.ii:.ise.s 
with decreu sin pIhmniiJ ^iiiii.iiiuiaijOM used In 
the uaturfcrtkm. Tlic quttiUlty of pl'ttJ M nsaod- 
atou wiiri ^93 celb, dfter trunsfectlon with 40 
of nii^mid, was 35 pg per 100 ng genomic DNA. 
Tills results in -520 plaainid copies per cell. 



DISCUSSION 

Wo have described a new method for qua nth til- 
ing gene copy numbers using reaMlmc an u lysis 
of PCK amplification.*. ReaMlmc PCK i< compat- 
ible with dthtir of UlC two PCK (K7-PCR) ap- 
proaches: (1) quantitative comp^titivi; where an 
iiiteuiid competitor for each target .sequence is 
used for normaH^adon (data not shown) or (2) 
qua utiiaiive comparative PCK using t» luiinialUa- 
tion gene contained within the sample (i.e., |3-ac- 
tiu) or a "housekeeping" gene for RT-PCK. ff 
equal amounts of nucleic udd are analyzed for 
each sample and if lhe amphflcatiun effiticru.y 
before quantitative analysis is identical for each 
sample, the vmcnial cunhoi (oujmoliwtiou gene 
or competiffjr) should give equal signals for flIJ 
samples. 

The real-lime PCH method uffcrs several ad- 
vrmtages over the other twu mclhod.s currently 
employed (sec the Introduction). I-'irst, the real- 
time PCR method is performed in a doscd-tube 
system and requires no past-PCR martipulatlon 
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Flgur* 4 Quantitative fmntyniK of pFSTM in transfected cells. (A) Amount o/ 
plasmid DNA used for the trunsfectlon plotted against the iihhim C, value deter- 
m vwL f ° r pf8TM rcmaInin 9 M hr alter transection, (0,Q Standard curvns of 
pf.R1M and |J-acdr» f respectively. pf8TM DNA (fl) and genomic DNA (Q were 
diluted XArlally 1 : S before amplification with the appropriate primers. The p-actin' 
standard curve wa« usgd to normalise the results of A to 100 ny of genomic DNA. 
(0) The amount of pFSTM present per 100 ng of genomic DNA. 



of .wmplv. Therefore, I hi- potential for PCR con- 
tamination in the labc >r;itc.»ry is reduced because 
Amplified products can be analysed and disposed 
of without opening the reaction lubes. Second, 
this method suppoUs the umi uf a iiuruiulixaUoi] 
gene (Lc,, p-actin) for quantitative PCR or house- 
keeping genes for quantitative RT-l'Ck controls. 
Analysis is performed in real time during the Jog 
phase of product accumulation. Analysis during 
lug phase permits many different genes (over a 
wide input target range) to be analysed simulta- 
neously, without concern of reaching rend ion 
plateau at different cycles, This will make niulll- 
gene analysis assays much culisi lu develop, be- 
cause individual internal i.ninpclUoi> will nui be 
needed for each gene under analysis. Third, 
sample throughput will inciease dramatically 
with the new method because, there is no post- 
1*CK processing time. Additionally, woiking In a 
96-well format is highly compatible with auto* 
inalion technology, 

The real-time PCR method is* highly repro. 
ducible. Replicate amplifications can be analysed 



for c^ach sample minimizing j>otenilol error. The. 
.system allows H> T a very large assay dynamic 
runge (approaching 1, 000,000 -fold Mailing Uu- 
gel). Uatng a .standard curve for the. target oJ in- 
terest, relative copy number values can be deter- 
mined for any uiiknuwu Nun/pk\ tfuor«*$cent 
threshold values, G r , courJair. linearly with rela- 
tive DNA copy number*. Heal time quantitative 
KT-rCJK methodology (Gibson et ah, this Ijuuc-) 
ha.i also bean developed, finally, rcaJ time quan- 
titative PCR methodology can be used ludcveJup 
high-throughput scrcenUig assay.* for a variety of 
applications [quantitative gene Cftfrjeaaiuu (RT- 
rCR), gene copy aaaaya (I1cr2, I11V, eic.)i gcm> 
typing (knockout mouse analysis), and Jmmuno- 
PCUJ. 

Real-time VCM may al.tc? be j»erformcd using 
intercalating dyes (Hlguchi ci ul. such as 

ciJiJdiurn bromide. The fluorogenic probe 
method offers a major advantage over inter- 
calating dyes- greater specificity (i.e., primer 
dimers and nonspecific PCR products are not de- 
tected). 
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METHODS 

Generation of <i Plasmld Containing a Partial 
cDNA far Human Factor Ylll 

'low I KNA Wdi harvested (UNAkoI to (rum '! <•! Test, inc., 
rrjencUYeood, TX) from c«H> i»*n*fc<led wtih a <aetur VI U 
rxjiressluu vector, pC:iS2.UeaM* (Koum v( id. WHO; Cor* 
ninn ci al. 1990), A factor VIII partial cUNA wvpiciuv W(!S 
K nit<Tn(c<i by irj' l»f:ii [<U*iicAnip |C. tTlh UNA PHI Xll 
(pan N»0rwnvs # i'b Applied isiosyucnis, I'ostei CMy, 

using i lie 1*C -It priuiurs VHfor nml I-'Rtcv <prifin-r sequence* 
arc shown below). The ampllcon was ^amp lifted OAlnfi 
modified I'Bfor and Wrcv primers tiyipnulwl with HumIU 
and H/irdlll restriction site sequence* »' Uiv V epdj ami 
clonal Into jXilCM- 3Z (rwm«i;u CU>rp. f MttOwem, Wl). The 
resulting c*«nr, pJ'STM, was uwd lor transient transfectlon 
of 293 cells. 



Amplification of Target DNA ami Dulccilon of 
Amplicon Factor VH! Plasmid DNA 

(pPHTM) was tunpltfttnl with the piimci» l'Hfor 5'-<X;C;- 

<rr<i<:<^\Ac»Au:jXiAtxiicnv.-3' and i*rcv .v-aaa<:<7J'- 

i'JKCXXrVOQA'lXH? I'ACJC-a'.Hic rvni'thui pfoduwd *i 422- 
np K:k product, The forward primer was de>h;iied to icv 
ognt/.e u unique »ei picnic ft mi id iii the ,V untranslated 
region of the; paicul pC132.tfc.25l > plaamid tit til ihurcfoie 
does nut icuiH'tUv. and amplify the human faetnr VIII 
gene, I'rimnrf. woro choxen with the iiv*ivt<»"re of I he com. 
pulcr program Oligo 1.0 (Nutiimul lliuacieiiecs, |m\, Ply- 
mouth, MN). The human p-acttn gene whs Amplified with 
the primer* [J-t«*tin forwnc d primer S' TCACOOAOA* "PC !T 
GCCCATO'ACOA-.V and p-aclin reverse piirncr A'-CAf!- 
CGGAACCC:<:r<:Ari<;<:c;AATGG-3\ The react Ion pro- 
oucea a 2V5np rc:n product. 

Amplification reactions (50 (uul) contained a ON A 
sample, 10 X I'CR liunV.r II (a 200 p.M dATl\ dCTr, 
dGTP, and 400 \km rill'IT, A mM Mg(;i ? , l.XA Units Ampll 
7Vi<; DNA polymerase, 0,5 unit AmpKrnsc uracil rV-j;iy- 
wwylajK- <UHG), 60 pinoJw of each faciei VIII jirlmvi, und 1S 
pinole of itue.li p ac.lln pi liner, 'Hut icaellun* hIm* iitmlatucO* 
0(K Of the f Of lowing detect l«n pmlM«s (lOO nM ciirh): 

J'8pr«.»h4* A'(i/AM)Ac:frrrri , c:cu<:<:TCifrn*(rriT<:rc; , r- 
GCCTT(TAMRA)p 3' aiid 0-ntiin probe 5' (FAM)ATCCUX;- 
XO'AMKAJCCCCCATGCCA'rCp-.T where p indicates 
phosphoryiftlion And X Indirotcs a linker arm nucleotide. 
Rene Hon UiIk*.i w«-n« MicmAmp Optical TuIks {part i\Uffl- 
IhtNKOI 00.1.*i, Perklii lUmox) thai worv fruBUtl Cut JVrlilti 
Timer) (o pTvvciil UjjM from /cflccllnp, Tube capi were 
slmihti' t« Micrt>AiVtp t^npa hut jpeciivlly designed to pre- 
vent light *v«tlvr>ng. All cil ili<- Vi'M vUnirtumdlvU'* were 5U>>- 
r li,.a Ivy PK Applied Uiofiy>tcm9 CUy, Ca) except 

l hr factor VIU prliuera, which wru* synthesized at Ceuvn 
lech, Inc. (Smith f-uti rmnclsco, CA). Pro lies wt-n- design 
tailing the Oli(;r.> 4.0 50 ft wore, following guUlcllncv wig- 
^esien in tnc Model 7700 .sequence Dennor In.Kiimiienl 
manual. Itrlcfly, prube T m ^umUi he Al least 5 W C higher 
mail oh* arvneuMux iciupCMlu": uaciI ilttrl/i^; thrrmiil ey- 
rhtig; primers should not Xuini \i<ihlv duplexed wild the 
prorx*. 

The ihcriuo) i-ydlng condition* Included 2 juln »t 
50 V C and 10 min al 95"C. Thermal cycling prorrr<lrd wllh 



RIAL 1IML QUANII1A1IVI- 
reactions were perforated i»» Iho Model 77011 .Sequence IX" 

UtKir Applied UlusyvK'uiw), uihlrh contaUiv -a Genr- 
Anip Vi'lM SyRtvm POOO. Hcacllon cwiidition* wcrr- p«i. 
grui III iivd on .1 I'wwar Mucin VI 00 (Apple CV.tmpnlPr, 
Sonta Clara, C^\) Jinked dirvcity to the Model WOO 

IXilffCtor. Ana'y«l» *»f data wh ahu |tt*KMrmNl on 

the MNi lntrrth computer, f :nllnrtioii and analyst k toft wn re 

w»» dovclo|wl Ht 10% Ap|)lled MosyMums. 

Transection of CclU with Factor VIII Conrtruci 

Pnur I17.S Oasks of 293 cells (ATCX: CM. \S7'M n human 
feiol lddney suspension cell line, were n">wn to 80% con- 
llucnvy Aftd tranifcctvd pl ; rfi*M. a-lls were K rnw " l" tl**> 
Allowing medio? 50% HAM'S HI 2 without GMT, 50% low) 
glucose PuJtKTen'a modified Kaxlo medium (DMIIM) wlth« 
enn glycint! will! sodium bicarNwiate, 10% ictal bovine 
sctuiji, 2 iiim u-kIwI«»i»)^ and 1% pcniciliin-strcptomy- 
vln. The. media wo cJiBiis'cd 30 niln Mo«» the iransfee 
lion, pPUl M DNA 4mountt» of 40, 4, 0„*v, .tnd 0.1 were 
iiUitacJ it> l.A ml of a solutlun containing 0.125 m CuCl* 
And 1 X MUHiS. The four mixtures were left at roo)n ten»- 
|jOTt»m- (tit 1 0 mln and thetx iiddml dmpwtu- u> die cells. 
Tire na*k> w V »-h«..ul^led al 37°C and 5»% CX\ for 24 hr, 
wasiicd with PUS, a*itJ r<u u »pcndcd In PUS. The H'kii.i 
jn-nd^l cclla were divided into tdiipiots und UNA wftd cv- 
traitted luimcdlutely tuiiig ihvQIAu/up KUhhI Kil (Qlapen. 
Ca«n?m»rtli, CA>, l>NA w»s e.luled Into 200 ol 30 p»m 
TrU-lIC) ul pU K.0. 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line CS7MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (0 C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracylihe repressive promoter, and (li) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q24.3. WISP-1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISP-3 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to > 30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

in mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3j3 (GSK-3/3) resulting in an increase in 
0-catenin levels. Stabilized 0-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
0-catenin levels (9). APC is phosphorylated by GSK-3j3, binds 
to 0-catenin, and facilitates its degradation. Mutations in 
either APC or /3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-/3 superfamily, and the 
homeobox genes, engrailed ,goosecoid, twin (Xtwn), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
. Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP-2> and a third related gene, WISP-3. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov t a family not previously linked to Wnt signaling, 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
. Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF100777, 
AF100778, AF100779, AF100780, and AF100781). 
1To whom reprint requests should be addressed, e-mail: diane@gene. 
com. 
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cDNA was synthesized from 2 of poly(A)* RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 
of poly(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 u-M of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DN As were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2<a«) where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
3-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The WISP- specific signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match: To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-l cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on g-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis arid reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-1. The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of ~40,000 (M T 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 2A). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of **27,000 (Af r 27 K) (Fig. IB). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 
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Fig. 1. WISP-1 and WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 (A) and 
WISP-2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 jig) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP- /-specific probe 
(amino acids 278-300) or a 190-bp WS^-specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were re hybridized with 
human /3-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-1 (A) and mouse and human WISPS (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-1 . 

Identification of WISPS. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISPS cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
' human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 14). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISPS, WISP-2, and WISPS are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISPS are homologous to the recently 
described rat gene, rCopS (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogenic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced byTGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, . similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor* (IGF)- 
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Fig. 3. {A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISPS was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISPS 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISP-2. Expression of 
W1SP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 £-//). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 




Fig. 4. (A, C £, and G) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power (A and J3), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in £-//. At low 
power (£ and F),.expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and //). 



the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISPS is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PCR and Southern blot analysis. (Fig. 
5 A and B), Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 
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Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. {A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 /xg) 
digested with EcoRl (WISP-1) or Xbal (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy . 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WlSP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WISP-3 RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISP-3 ranged from 
4- to >40-fold. 



DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-1 , WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., 0-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through 0-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-/3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a w fo serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
. tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-01, which is the stimulus for 
stromal proliferation (34). TGF-/31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop~l t the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and /3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic )3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. , 

We thank the DNA synthesis group for oligonucleotide synthesis, T. 
Baker for technical assistance, P. Dowd for radiation hybrid mapping, 
K. Willert and R. Nusse for the tet-repressible C57MG/Wnt-1 cells, V. 
Dixit for discussions, and D. Wood and A. Bruce for artwork. 

1. Cadigan, K. M. & Nusse, R. (1997) Genes Dev. U, 3286-3305. 

2. Dale, T. C. (1998) Biochem. /. 329, 209-223. 

3. Nusse, R. & Varmus, H. E. (1982) Ceil 31, 99-109. 

4. van Ooyen, A. & Nusse, R. (1984) Cell 39, 233-240. 

5. Tsukamoto, A. S., Grosschedl, R., Guzman, R. C, Parslow, T. & 
Varmus, H. E. (1988) Cell 55, 619-625. 

6. Brown, J. D. & Moon, R. T. (1998) Curr. Opin. Cell. Biol. 10, 
182-187. 

7. Molenaar, M., van de Wetering, M., Oosterwegel, M., Peterson- 
Maduro, J., Godsave, S., Korinek, V., Roose, J., Destree, O. & 
Clevere, H. (1996) Cell 86, 391-399. 



Proc. Natl. Acad. Sci. USA 95 (1998) 

8. Korinek, V., Barker, N., Willert, K., Molenaar, M., Roose, J., 
Wagenaar, G., Markman, M., Lamers, W., Destree, O. & Clevers, 
H. (1998) Mol. Cell. Biol. 18, 1248-1256. 

9. Munemitsu, S., Albert, I., Souza, B., Rubinfeld, B. & Polakis, P. 
(1995) Proc. Natl. Acad. Sci. USA 92, 3046-3050. 

10. He, T. C, Sparks, A. B. t Rago, G, Hermeking, H., Zawel, L, da 
. Costa, L. T., Morin, P. J., Vogelstein, B. & Kinzler, K. W. (1998) 

Science 281, 1509-1512. 

11. Diatchenko, L., Lau, Y. F., Campbell, A. P., Chenchik, A., 
Moqadam, F., Huang, B., Lukyanov, S., Lukyanov, IC, Gurskaya, 
N., Sverdlov, E. D. & Siebert, P. D. (1996) Proc. Natl. Acad. Sci. 

. USA 93, 6025-6030. 

12. Brown, A, M., Wildin, R. S.„ Prendergast, T. J. & Varmus, H. E. 
(1986) Ce//46, 1001-1009. 

13. Wong, G. T., Gavin, B. J, & McMahon, A. P. (1994) Mol. Cell. 
Biol. 14, 6278-6286. 

14. Shimizu, H., Julius, M. A., Giarre, M., Zheng, Z., Brown, A. M. 
& Kitajewski, J. (1997) Cell Growth Differ. 8, 1349-1358. 

15. Hashimoto, Y., Shindo-Okada, N. t Tani, M., Nagamachi, Y., 
Takeuchi, K., Shiroishi, T M Toma, H. & Yokota, J. (1998)/. Exp. 
Med. 187, 289-296. 

1 6. Zhang, R, Averboukh, L„ Zhu, W., Zhang, H., Jo, H., Dempsey, 
P. J., Coffey, R. J., Pardee, A. B. & Liang, P. (1998) Mol. Cell. 
Biol. 18, 6131-6141. 

17. Grotendorst, G. R. (1997) Cytokine Growth Factor Rev. 8, 171- 
179. 

18. Kireeva, M. L., Mo, F. E., Yang, G. P. & Lau, L. F. (1996) Mol 
Cell Biol 16, 1326-1334. 

19. Babic, A. M„ Kireeva, M. L., Kolesnikova, T. V. & Lau, L. F. 
(1998) Proc. Natl. Acad. Sci. USA 95, 6355-6360. 

20. Martinerie, C, Huff, V., Joubert, I., Badzioch, M M Saunders, G., 
Strong, L. & Perbal, B, (1994) Oncogene 9, 2729-2732. 

21. Bork, P. (1993) FEBS Lett. 327, 125-130. 

22. Kim, H. S., Nagalla, S. R., Oh, Y., Wilson, E., Roberts, C. T., Jr. 
& Rosenfeld, R. G. (1997) Proc. Natl Acad. Sci. USA 94, 
12981-12986. 

23. Joliot, V M Martinerie, G, Dambrine, G., Plassiart, G., Brisac, M., 
Crochet, J. & Perbal, B. (1992) Mol. Cell Biol 12, 10-21. 

24. Mancuso, D, J., Tuley, E. A., Westfield, L. A., Worrali, N. K., 
Shelton-lnloes, B. B., Sorace, J. M., Alevy, Y. G. & Sadler, J. E. 
(1989)/. Biol. Chem. 264, 19514-19527. 

25. Holt, G. D., Pangburn, M. K. & Ginsburg, V. (1990) /. Biol 
Chem. 265, 2852-2855. 

26. Voorberg, J., Fontijn, R., Calafat, J., Janssen, H., van Mourik, 
J. A. & Pannekoek, H. (1991)/. Cell Biol 113, 195-205. 

27. Martinerie, C, Viegas-Pequignot, E., Guenard, I., Dutrillaux, B., 
Nguyen, V. G, Bernheim, A. & Perbal, B. (1992) Oncogene 7, 
2529-2534. 

28. Takahashi, E., Hori, T, O'Connell, P., Leppert, M. & White, R. 
(1991) Cytogenet. Cell Genet. 57, 109-111. 

29. Meese, E., Meltzer, P. S., Witkowski, C. M. & Trent, J. M. (1989) 
Genes Chromosomes Cancer 1, 88-94. 

30. Garte, S. J. (1993) Crit. Rev. Oncog. 4, 435-449. 

31. Zhang, L., Zhou, W., Velculescu, V. E., Kern, S. E., Hruban, 
R. H., Hamilton, S. R., Vogelstein, B. & Kinzler, K. W. (1997) 
Science 276, 1268-1272. 

32. Sun, P. D. & Davies, D. R. (1995) Annu. Rex: Biophys. Biomol 
Struct. 24, 269-291. 

33. Kireeva, M. L, Lam, S. C. T. & Lau, L. F. (1998)/. Biol Chem. 
273, 3090-3096. 

34. Frazier, K. S. & Grotendorst, G. R. (1997) Int. /. Biochem. Cell 
Biol. 29, 153-161. 

35. Wernert, N. (1997) Virchows Arch. 430, 433-443. 

36. Tanner, M. M., Tirkkonen, M., Kallioniemi, A., Collins, C, 
Stokke, T., Karhu, R., Kowbel, D., Shadravan, F., Hintz, M., Kuo, 
W. L., et al (1994) Cancer Res. 54, 4257-4260. 

37. Brinkmann, U., Gallo, M., Polymeropoulos, M: H. & Pastan, I. 
(1996) Genome Res. 6, 187-194. 

■ 38. Bischoff, J. R, Anderson, L., Zhu, Y., Mossie, K., Ng, L., Souza, 
B., Schryver, B., Flanagan, P., Clairvoyant, F., Ginther, C, et al. 
(1998) EMBOJ. 17, 3052-3065. 

39. Morin, P. J., Sparks, A. B., Korinek, V., Barker, N., Clevers, H„ 
Vogelstein, B. & Kinzler, K. W. (1997) Science 275, 1787-1790. 

40. Lu, L. H. & Gillett, N. (1994) Celt Vision 1, 169-176. 




methods. Peptides AENK or AEQK were dissolved in water, made isotonic with 
NaCl and diluted into RPM1 growth medium. T-cell-proliferation assays were 
done essentially as described 2 " 1 . Briefly, after antigen pulsing (30u.gmr" 
TTCF) with tetrapeptides {l-2mgmr'), PBMCs or EBV-B cells were 
washed in PBS and fixed for 45 s in 0.05% glutaraldehyde. Glycine was added 
to a final concentration of 0.1M and the cells were washed five times in RPM1 
1640 medium containing 1% FCS before co-culture with T-cell clones in 
round-bottom 96-well microtitre plates. After 48 h, the cultures were pulsed 
with 1 u.Ci of 3 H-thymidine and harvested for scintillation counting 16 h later. 
Predigestion of native TTCF was done by incubating 200 u.g TTCF with 0.25 u.g 
pig kidney legumain in 500 u.1 50 mM citrate buffer, pH 5.5, for 1 h at 37 °C. 
Glycopeptide digestions. The peptides HIDNEEDI, HIDN(N-glucosamine) 
EEDI and H1DNESD1, which are based on the TTCF sequence, and 
QQQHLFGSNVTDCSGNFCLFR(KKK), which is based on human transferrin, 
were obtained by custom synthesis. The three C-terminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGSNVTDCSGNFCLFR was prepared by tryptic (Promega) digestion 
of 5mg reduced, carboxy- methylated human transferrin followed by 
concanavalin A chromatography". Glycopeptides corresponding to residues 
622-642 and 421-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N-terminal sequencing. The lyophilized transferrin- 
derived peptides were redissolved in 50 mM sodium acetate, pH 5.5, 10 mM 
dithiothreitol, 20% methanol. Digestions were performed for 3 h at 30 °C with 
5-50 mU ml"' pig kidney legumain or B-ceU AEP. Products were analysed by 
HPLC or MALDI-TOF mass spectrometry using a matrix of lOmgrnf 1 ct- 
cyanocinnamic acid in 50% acctonitrile/0,1% TFA and a PerSeptive Biosystems 
Elite STR mass spectrometer set to linear or reflector mode. Internal standar- 
dization was obtained with a matrix ion of 568.13 mass units. 
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Fas ligand (FasL) is produced by activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target cells through the death receptor Fas/Apol/CD95 (ref. 1). 
One important role of FasL and Fas is to mediate immune- 
cytotoxic killing of cells that are potentially harmful to the 
organism, such as virus-infected or tumour cells 1 . Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumours studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent immune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gene superfamily 2 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members of 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteine-rich domains 
(CRDs). Like one other TNFR homologue, osteoprotegerin (OPG) 1 , 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane-asscociated, 
molecule. We expressed a recombinant, histidine-tagged form of 
DcR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 shares 
sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carboxy-terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). We detected a predominant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and lung. In addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
DcR3-Fc to human 293 cells transfected, with individual TNF- 
family ligands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL 4 (Fig. 2a), but not to cells transfected with 
TNF 5 , Apo2L/TRAIL 67 t Apo3 L/TWEAK 8,9 , or OPGL/TRANCE/ 
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RANKL 10 " 12 (data not shown). DcR3-Fc immunoprecipitated shed 
FasL from FasL-transfected 293 cells (Fig. 2b) and purified soluble 
FasL (Fig. 2c), as did the Fc-tagged ectodomain of Fas but not 
TNFR1. Gel- filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a comparable affinity (K d = 0.8 ± 0.2 and 
l.l±0.1nM, respectively; Fig. 2e), and that DcR3-Fc could 
block nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, 
inset). Thus, DcR3 competes with Fas for binding to FasL. 

To determine whether binding of DcR3 inhibits FasL activity, we 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig. 3a). DcR3- 
Fc and Fas-Fc blocked soluble- FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
^-0.1 u-gmr 1 . Time-course analysis showed that the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3-Fc on activation- 
induced cell death (AICD) of mature T lymphocytes, a FasL- 
dependent process 1 . Consistent with previous results 13 , activation 
of interleu&n- 2 -stimulated CD4-positive T cells with anti-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 



induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granzymes M4 " , \ Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Fc 
and Fas-Fc each reduced killing of target cells from —65% to 
—30%, with half-maximal inhibition at —1 u-gml" 1 ; the residual 
killing was probably mediated by the perforin/granzyme pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fc and Fas-Fc concentrations were required 
to block natural killer cell activity compared with those required to 
block soluble FasL activity, which is consistent with the greater 
potency of membrane-associated FasL compared with soluble 
FasL 17 . 

Given the role of immune-cytotoxic cells in elimination of 
tumour cells and the fact that DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immune-cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesis, we 
investigated whether the DcR3 gene is amplified in cancer. We 
analysed DcR3 gene-copy number by quantitative polymerase chain 
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Figure 1 Primary structure and expression of human DcR3. a, Alignment of the 
amino-acid sequences of DcR3 and of osteoprotegerin (OPG); the C-terminal 101 
residues of OPG are not shown. The putative signal cleavage site (arrow), the 
cysteine-rich domains (CRD 1 -4), and the /V-ltnked glycosylation site (asterisk) are 
shown, b, Expression of DcR3 mRNA. Northern hybridization analysis was done 
using the DcR3 cDNA as a probe and blots of poly(A)* RNA (Clontech) from 
human fetal and adult tissues or cancer cell lines. PBL, peripheral blood 
lymphocyte. 
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Figure 2 Interaction of DcR3 with FasL a, 293 cells were transfected with pRK5 
vector (top) or with pRK5 encoding full-length FasL (bottom), incubated with 
DcR3-Fc (solid line, shaded area), TNFR1-Fc (dotted line) or buffer control 
(dashed line) (the dashed and dotted tines overlap), and analysed for binding by 
FACS. Statistical analysis showed a significant difference (P.< 0.001 ) between the 
binding of DcR3-Fc to cells transfected with FasL or pRK5. PE, phycoerythrin- 
labelled cells, b, 293 cells were transfected as in a and metabolically labelled, and 
cell supernatams were immunoprecipitated with Fc-tagged TNFR1, DcR3 or Fas. 
c, Purified soluble FasL (sFasL) was immunoprecipitated with TNFR1 -Fc, DcR3- 
Fc or Fas-Fc and visualized by immunoblot with anti-FasL antibody. sFasL was 
loaded directly for comparison in the right-hand lane, d, Flag-tagged sFasL was 
incubated with DcR3-Fc or with buffer and resolved by gel filtration; column 
fractions were analysed in an assay that detects complexes containing DcR3-Fc 
and sFasL-Flag. e, Equilibrium binding of DcR3-Fc or Fas-Fc to sFasL-Flag. 
Inset, competition of DcR3-Fc with Fas-Fc for binding to sFasL-Flag. 
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reaction (PCR)' 8 in genomic DNA from 35 primary lung and colon 
tumours, relative to pooled genomic DNA from peripheral blood 
leukocytes (PBLs) of 10 healthy donors. Eight of 18 lung tumours 
and 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18-fold (Fig. 4a, b). To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
of DcR3-based PCR primers and probes; we observed nearly the 
same amplification (data not shown). 

We then analysed DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization. We detected DcR3 expres- 
sion in 6 out of 15 lung tumours, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and 1 out of 1 gastric tumour (data not shown). A 
section through a squamous-cell carcinoma of the lung is shown in 
Fig. 4c. DcR3 mRNA was localized to infiltrating malignant epithe- 
lium, but was essentially absent from adjacent stroma, indicating 
tumour-specific expression. Although the individual tumour speci- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DcR3 gene is amplified frequently in tumours. 
SW480 colon carcinoma cells, which showed abundant DcR3 
mRNA expression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, as shown by quantitative PCR (fourfold) and by Southern blot 
hybridization (fivefold) (data not shown). 

If DcR3 amplification in cancer is functionally relevant, then 
DcR3 should be amplified more than neighbouring genomic 
regions that are not important for tumour survival. To test this, 
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Figure 3 Inhibition of FasL activity by DcR3. a, Human Jurkat T leukaemia cells 
were incubated with Flag-tagged soluble FasL (sFasL;. Sngml" 1 ) oligomerized 
with anti-Flag antibody (0.1 jigmr') in the presence of the proposed inhibitors 
DcR3-Fc, Fas-Fc or human IgGi and assayed for apoptosis (mean ± s.e.m. of 
triplicates), b. Jurkat cells were incubated with sFasL-Flag.plus anti-Flag antibody 
as in a, in presence of 1 u,g ml*' DcR3-Fc (filled circles), Fas-Fc (open circles) or 
human IgGi (triangles), and apoptosis was determined at the indicated time 
points, c. Peripheral blood T cells were stimulated with PHA and interleukin-2. 
followed by control (white bars) or anti-CD3 antibody (filled bars), together with 
phosphate-buffered saline (PBS), human IgGi, Fas-Fc, or DcR3-Fc (lO^gm! -1 ). 
After 16 h, apoptosis of CD4* cells was determined (mean ± s.e.m. of results from 
five donors), d. Peripheral blood natural killer cells were incubated with 5, Cr- 
labelled Jurkat cells in the presence of DcR3-Fc (rilled circles). Fas-Fc (open 
circles) or human IgGi (triangles), and target-cell death was determined by 
release of 5, Cr (mean ± s.d. for two donors, each in triplicate). 



we mapped the human DcR3 gene by radiation-hybrid analysis; 
DcR3 showed linkage to marker AFM2 18xe7 (T160), which maps to 
chromosome position 20ql3. Next, we isolated from a bacterial 
artificial chromosome (BAC) library a human genomic clone that 
carries DcR3, and sequenced the ends of the clone's insert. We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and of 
seven genomic markers that span chromosome 20 (Fig. 4d). The 
DcR3 -linked reverse marker showed an average amplification of 
roughly threefold, slightly less than the approximately fourfold 
amplification of DcR3; the other markers showed little or no 
amplification. These data indicate that DcR3 may be at the 'epi- 
centre' of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with the possibility that DcR3 amplification 
promotes tumour survival. 

Our results show that DcR3 binds specifically to FasL and inhibits 
FasL activity. We did not detect DcR3 binding to several other TNF- 
ligand- family members; however, this does not rule out the possi- 
bility that DcR3 interacts with other ligands, as do some other 
TNFR family members, including OPG 2 * 19 . . 

FasL is important in regulating the immune response; however, 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFLIP, which modulates apoptosis signal- 
ling downstream of Fas 20 . A second mechanism involves proteolytic 
shedding of FasL from the cell surface 17 . DcR3 competes with Fas for 
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Figure 4 Genomic amplification of DcR3 in tumours, a, Lung cancers, comprising 
eight adenocarcinomas (c, d, f, g. h, j, k, r), seven squamous-cell carcinomas (a, e, 
m, n, o, p, q), one non-small-cell carcinoma (b), one small-cell carcinoma (i), and 
one bronchial adenocarcinoma (I). The data are means * s.d. of 2 experiments 
done in duplicate, b, Colon tumours, comprising 17 adenocarcinomas. Data are 
means ± s.e.m. of five experiments done jn duplicate, c, In situ hybridization 
analysis of OcR3 mRNA expression in a squamous-cell carcinoma of the lung. A 
representative bright-held image (left) and the corresponding dark-field image 
(right) show DcR3 mRNA over infiltrating malignant epithelium (arrowheads). 
Adjacent non-malignant stroma (S), blood vessel (V) and necrotic tumour tissue 
(N) are also shown, d, Average amplification of DcR3 compared with amplifica- 
tion of neighbouring genomic regions (reverse and forward, Rev and Fwd), the 
0cR3-linked marker T160, and other chromosome-20 markers, in the nine colon 
tumours showing DcR3 amplification of twofold or more (b). Data are from two 
experiments done in duplicate. Asterisk indicates P < 0.01 for a Student's Mest 
comparing each marker with DcR3. 
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FasL binding; hence, it may represent, a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interleukin-1 has been 
described 21 . In addition, two decoy receptors that belong to the 
TNFR family, DcRl and DcR2, regulate the FasL-related apoptosis- 
inducing molecule Apo2L 22 . Unlike DcRl and DcR2, which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR-family member is 
OPG 3 , which shares greater sequence homology with DcR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L 19 . Thus, DcR3 and OPG define a new subset of 
TNFR-family members that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNF-family ligands, 
thereby modulating the antiviral immune response 2 . Our results 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours. D 



Methods 

Isolation of DcR3 cONA. Several overlapping ESTs in Gen Bank (accession 
numbers AA025672, AA025673 and W67560) and in Lifeseq™ (Incyte 
Pharmaceuticals; accession numbers 1339238, 1533571, 1533650, 1542861, 
1789372 and 2207027) showed similarity to members of the TNFR family. We 
screened human cDNA libraries by PCR with primers based on the region of 
EST consensus; fetal lung was positive for a product of the expected size. By 
hybridization to a PCR-generated probe based on the ESTs, one positive clone 
(DNA30942) was identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, we isolated 50 
more clones; the coding regions of these clones were identical in size to that of 
the initial clone (data not shown). 

Fc-fusion proteins {immunoadhesins). The entire DcR3 sequence, or the 
ectodomain of Fas or TNFR1, was fused to the hinge and Fc region of human 
IgGl, expressed in insect SF9 cells or in human 293 cells, and purified as 
described". 

Fluorescence-activated celt sorting (FACS) analysis. We transfected 293 
cells using calcium phosphate or Effectene (Qiagen) with pRK5 vector or pRK5 
encoding full-length human FasL 4 (2 u,g), together with pRK5 encoding CrmA 
(2 jig) to prevent cell death. After 16 h, the cells were incubated with 
biotinylated DcR3-Fc or TNFRl-Fc and then with phycoerythrin-conjugated 
streptavidin (GibcoBRL), and were assayed by FACS. The data were analysed by 
Kolmogorov-Smirnov statistical analysis. There was some detectable staining 
of vector-transfected cells by DcR3-Fc; as these cells express little FasL (data 
not shown), it is possible that DcR3 recognized some other factor that is 
expressed constitutively on 293 cells. 

Immunopreciprtation. Human 293 cells were transfected as above, and 
metabolically labelled with [ 35 Slcysteine and [ 3S S]methionine (0.5 mCi; 
Amersham). After 16 h of culture in the presence of z-VAD-fmk (10u.M), 
the medium was immunoprecipitated with DcR3-Fc, Fas-Fc or TNFRl-Fc 
(5u,g), followed by protein A-Sepharose (Repligen). The precipitates were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified, Flag-tagged soluble FasL (1 u.g) (Alexis) was incubated 
with each Fc-fusion protein (1 u.g), precipitated with protein A-Sepharose, 
resolved by SDS-PAGE and visualized by immunoblotting with rabbit anti- 
FasL antibody (Oncogene Research). 

Analysis of complex formation. Flag-tagged soluble FasL (25y,g) was 
incubated with buffer or with DcR3-Fc (40 u.g) for 1.5 h at 24 °C. The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and developed 
with PBS; 0.6-ml fractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysed by placing 100 u.1 aliquots into microtitre 
wells precoated with anti-human lgG (Boehringer) to capture DcR3-Fc, 
followed by detection with biotinylated anti-Flag antibody Bio M2 (Kodak) and 
streptavidin-horseradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular mass of the complex of 420K (data not 
shown), which is consistent with a stoichiometry of two DcR3-Fc homodimers 
to two soluble FasL homotrimers. 

Equilibrium binding analysis. Microtitre wells were coated with anti-human 



IgG, blocked with 2% BSA in PBS. DcR3-Fc or Fas-Fc was added, followed by 
serially diluted Flag-tagged soluble FasL. Bound ligand was detected with anti- 
Flag antibody as above. In the competition assay, Fas-Fc was immobilized as 
above, and the wells were blocked with excess IgGl before addition of Flag- 
tagged soluble FasL plus DcR3-Fc. 

T-cell AICD. CD3 + lymphocytes were isolated from peripheral blood of 
individual donors using anti-CD3 magnetic beads (Miltenyi Biotech), 
stimulated with phytohaemagglutinin (PHA; 2 u.g ml -1 ) for 24 h, and cultured 
in the presence of interleukin-2 ( 100 U ml" 1 ) for 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed for apoptosis 
16 h later. by FACS analysis of annexin-V-binding of CD4 + cells 14 . 
Natural killer cell activity. Natural killer cells were isolated from peripheral 
blood of individual donors using anti-CD56 magnetic beads (Miltenyi 
Biotech), and incubated for 16 h with 5t Cr-loaded Jurkat cells at an effector- 
to-target ratio of 1:1 in the presence of DcR3-Fc, Fas-Fc or human IgGl. 
Target-cell death was determined by release of 51 Cr in effector- target co- 
cultures relative to release of 5, Cr by detergent lysis of equal numbers of Jurkat 
ceUs. 

Gene-amplification analysis. Surgical specimens were provided by J. Kern 
(lung tumours) and P. Quirke (colon tumours). Genomic DNAwas extracted 
(Qiagen) and the concentration was determined using Hoechst dye 33258 
intercalation fluorometry. Amplification was determined by quantitative PCR" 
using a TaqMan instrument ( ABI). The method was validated by comparison of 
PCR and Southern hybridization data for the Myc and HER- 2 oncogenes (data 
not shown). Gene-specific primers and fluorogenic probes were designed on 
the basis of the sequence of DcR3 or of nearby regions identified on a BAC 
carrying the human DcR3 gene; alternatively, primers and probes were based 
on Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159), the nearest 
available marker which maps to —500 kilobases from T160, and five extra 
markers that span chromosome 20. The DcR3-specific primer sequences were 
5'-CTTCTTCGCGCACGCTG-3' and 5'-ATCACGCCGGCACCAG-3' and the 
fluorogenic probe sequence was 5'-(FAM-ACACGATGCGTGCTCCAAGCAG 
AAp-(TAMARA), where FAM is 5' -fluorescein phosphoramidite. Relative 
gene-copy numbers were derived using the formula 2 <iCT) , where ACT is the 
difference in amplification cycles required to detect DcR3 in peripheral blood 
lymphocyte DNA compared to test DNA. 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across membranes of both prokaryotes and 
eukaryotes 1 . The recently completed Escherichia coli genome 
sequence revealed that the largest family of paralogous E. coli 
proteins is composed of ABC transporters 2 . Many eukaryotic 
proteins of medical significance belong to this family, such as 
the cystic fibrosis transmembrane conductance regulator (CFTR), 
the P-glycoprotein (or multidrug-resistance protein) and the 
heterodimeric transporter associated with antigen processing 
(Tapl-Tap2). Here we report the crystal structure at 1.5 A resolu- 
tion of HisP, the ATP-binding subunit of the histidine permease, 
which is an ABC transporter from Salmonella typhimurium. We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HisP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

ABC transporters contain four structural domains: two nucleo- 
tide-binding domains (NBDs), which are highly conserved 
throughout the family, and two transmembrane domains'. In 
prokaryotes these domains are often separate subunits which are 
assembled into a membrane-bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain. The 
periplasmic histidine permease of 5. typhimurium and E. coli 1 *'* is a 
well-characterized ABC transporter that is a good model for this 
superfamily. It consists of a membrane-bound complex, HisQMP 2 , 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP, the ATP-binding subunit. HisP, which has 
•properties intermediate between those of integral and peripheral 
membrane proteins 9 , is accessible from both sides of the membrane, 
presumably by its interaction with HisQ and HisM*. The two HisP 
subunits form a dimer, as shown by their cooperativity in ATP 
hydrolysis 5 , the requirement for both subunits to be present for 
activity 8 , and the formation of a HisP dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer 3 . HisP has been purified 
and characterized in an active soluble form 3 which can be recon- 
stituted into a fully active membrane-bound complex 8 . 

The overall shape of the crystal structure of the HisP monomer is 
that of an T with two thick arms (arm I and arm II); the ATP- 
binding pocket is near the end of arm I (Fig. 1). A six-stranded g- 
sheet (p3 and p8-012) spans both arms of the L, with a domain of a 
a- plus p-type structure (pi, (32, p4-p7, al and ct2) on one side 
(within arm I) and a domain of mostly ot-helices (ct3-ct9) on the 




Figure 1 Crystal structure of HisP. a, View of the dimer along an axis 
perpendicular to its two-fold axis. The top and bottom of the dimer are suggested 
to face towards the periplasmic and cytoplasmic sides, respectively (see text). 
Trie thickness of arm II is about 25 A, comparable to that of membrane. a-Helices 
are shown in orange and p-sheets in green, b, View along the two-fold axis of the 
HisP dimer, showing the relative displacement of the monomers not apparent in 
a. The p-strands at the dimer interface are labelled, c, View of one monomer from 
the bottom of arm I, as shown in a, towards arm II, showing the ATP-binding 
pocket, a-c, The protein and the bound ATP are in 'ribbon' and 'ball-and-stick* 
representations, respectively. Key residues discussed in the text are indicated in 
c. These figures were prepared with MOLSCRIPT 29 . N. amino terminus; C, C 
terminus. 
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Gene amplification is a common event in the progression of 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative polymerase-chain-reaction (PCR) method, 
based on fluorescent TaqMan methodology and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real-time, was used to quantify 
gene amplification in tumor DNA. Reactions are character- 
ized by the point during cycling when PCR amplification is still 
in the exponential phase, rather than the amount of PCR 
product accumulated after a fixed number of cycles. None of 
the reaction components is limited during the exponential 
phase, meaning that values are highly reproducible in reac- 
tions starting with the same copy number. This greatly 
improves the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-product carry-over con- 
tamination; it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The real-time PCR method, was used to develop and validate 
a simple and rapid assay for the detection and quantification 
of the 3 most frequently amplified genes (myc, ccndl and 
erbB2) in breast tumors. Extra copies of myc, ccndl and erbB2 
were observed in 10, 23 and 15%, respectively, of 108 breast- 
tumor DNA; the largest observed numbers of gene copies 
were 4.6, 18.6 and 15.1, respectively. These results correlated 
well with those of Southern blotting. The use of this new 
semi-automated technique will make molecular analysis of 
human cancers simpler and more reliable, and should find 
broad applications in clinical and research settings. Int. L 
Cancer 78:661-666, 1998. 
© 1998 Wiley-Liss. Inc. 

Gene amplification plays an important role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosomal double minutes 
(dmins) or as integrated homogeneously staining regions (HSRs), 
are among the main visible cytogenetic abnormalities in breast 
tumors. Other techniques such as comparative genomic hybridiza- 
tion (CGH) (Kallioniemi et ai, 1 994) have also been used in broad 
searches for regions of increased DNA copy numbers in tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identify 
the critical gene(s) in each amplified region. To date, genes known 
to be amplified frequently, in breast cancers include myc (8q24), 
ccndl ( 11 q 1 3), and erbBl ( 1 7q 1 2-q2 1 ) (for review, see Bieche and 
Lidereau, 1995). 

Amplification of the myc, ccndl, and erbB2 proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis (Bems et ai, 1992; 
Schuuring et ai, 1992; Siamon et ai, 1987). Muss et ai (1994) 
suggested that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. 

However, published results diverge both in terms of the fre- 
quency of these alterations and their clinical value. For instance, 
over 500 studies in 10 years have failed to resolve the controversy 



surrounding the link suggested by Slamon et ai (1987) between 
erbB2 amplification and disease progression. These discrepancies 
are partly due to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific genes (DNA) were initially quantified in tumor cells by 
means of blotting procedures such as Southern and slot blotting. 
These batch techniques require large amounts of DNA (5-10 
ug/reaction) to yield reliable quantitative results. Furthermore, 
meticulous care is required at all stages of the procedures to 
generate blots of sufficient quality for reliable dosage analysis. 
Recently, PCR has proven to be a powerful tool for quantitative 
DNA analysis, especially with minimal starting quantities of tumor 
samples (small, early-stage tumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a varying number of cycles during the 
exponential phase (kinetic quantitative PCR). In the first case, an 
internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The method is relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quantifying a new gene, since no internal standard is required (an 
external standard curve is sufficient). Moreover, the kinetic method 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each aliquot, the amplifica- 
tion product. Interest in the kinetic method has been stimulated by a 
novel approach using fluorescent TaqMan methodology and a new 
instrument (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson et ai, 1996; Heid et 
ai, 1996). The TaqMan reaction is based on the 5' nuclease assay 
first described by Holland et ai (1991). The latter uses the 5' 
nuclease activity of Taq polymerase to cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PCR. The 
approach uses dual-labeled fluorogenic hybridization probes (Lee 
et ai, 1993). One fluorescent dye, co-valently linked to the 5' end 
of the oligonucleotide, serves as a reporter [FAM (i.e., 6-carboxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA (i.e., 6-carboxy-tetramethyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydrolyzed by the 
5'-3' nucleolytic activity of DNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the reporter dye (FAM) by the emission intensity of a 
reference dye (i.e., ROX, 6-carboxy-X-rhodamine) included in 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized 
reporter) for a given reaction tube. The use of a sequence detector 
enables the fluorescence spectra of all 96 wells of the thermal 
cycler to be measured continuously during PCR amplification. 

The real-time PCR method offers several advantages over other 
current quantitative PCR methods (Celi et al., 1994): (i) the 
probe-based homogeneous assay provides a real-time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a 
signal; (ii) the C, (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product accumulation. This is the main reason why C, is a more 
reliable measure of the starting copy number than are end-point 
measurements, in which a slight difference in a limiting component 
can have a drastic effect on the amount of product; (Hi) use of C, 
values gives a wider dynamic range (at least 5 orders of magni- 
tude), reducing the need for serial dilution; (iv) The real-time PCR 
method is run in a closed-tube system and requires no post-PCR 
sample handling, thus avoiding potential contamination; (v) the 
system is highly automated, since the instrument continuously 
measures fluorescence in all 96 wells of the thermal cycler during 
PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
available just one minute after thermal cycling is complete; (vii) the 
sample throughput of the method is high, since 96 reactions can be 
analyzed in 2 hr. 

Here, we applied this semi-automated procedure to determine 
the copy numbers of the 3 most frequently amplified genes in breast 
tumors (myc, ccndl and erbBl), as well as 2 genes (alb and app) 
located in a chromosome region in which no genetic changes have 
been observed in breast tumors. The results for 108 breast tumors 
were compared with previous Southern-blot data for the same 
samples. 

MATERIAL AND METHODS 
Tumor and blood samples 

Samples were obtained from 108 primary breast tumors removed 
surgically from patients at the Centre Rene Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy. Immedi- 
ately after surgery, the tumor samples were placed in liquid 
nitrogen until extraction of high-molecular-weight DNA. Patients 
were included in this study if the tumor sample used for DNA 
preparation contained more than 60% of tumor cells (histological 
analysis). A blood sample was also taken from 18 of the same 
patients. 

DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 

Real-time PCR 

Theoretical basis. Reactions are characterized by the point 
during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PCR product accumulated 
after a fixed number of cycles. The higher the starting copy number 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed. The parameter C t (threshold cycle) is 
defined as the fractional cycle number at which the fluorescence 
generated by cleavage of the probe passes a fixed threshold above 
baseline. The target gene copy number in unknown samples is 
quantified by measuring Q and by using a standard curve to 
determine the starting copy number. The precise amount of 
genomic DNA (based on optical density) and its quality (i.e., lack 



of extensive degradation) are both difficult to assess. We therefore 
also quantified a control gene (alb) mapping to chromosome region 
4qll-qI3. in which no genetic alterations have been found in 
breast-tumor DNA by means of CGH (Kallioniemi et al. 1 994). 

Thus, the ratio of the copy number of the target gene to the copy 
number of the alb gene normalizes the amount and quality of 
genomic DNA. The ratio defining the level of amplification is 
termed "N", and is determined as follows: 

copy number of target gene (app, myc, ccndl, erb&2) 

N = ■ ■ — . . 

copy number of reference gene (alb) 

Primers, probes, reference human genomic DNA and PCR 
consumables. Primers and probes were chosen with the assistance 
of the computer programs Oligo 4.0 (National Biosciences, Ply- 
mouth, MN), EuGene (Daniben Systems, Cincinnati, OH) and Primer 
Express (Perkin-Elmer Applied Biosystems, Foster City, CA). 

Primers were purchased from DN Agency (Malvern, PA) and 
probes from Perkin-Elmer Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybridization 
probes and primers are available on request. 

The TaqMan PCR Core reagent kit, MicroAmp optical tubes, 
and MicroAmp caps were from Perkin-Elmer Applied Biosystems. 

Standard-curve construction. The kinetic method requires a 
standard curve. The latter was constructed with serial dilutions of 
specific PCR products, according to Piatak et al. (1993). In 
practice, each specific PCR product was obtained by amplifying 20 
rig of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same primer pairs as those used later for 
real-time quantitative PCR. The 5 PCR products were purified 
using MicroSpin S-400 HR columns (Pharmacia, Uppsala, Swe- 
den) electrophorezed through an acrylamide gel and stained with 
ethidium bromide to check their quality. The PCR products were 
then quantified spectrophotometrically and pooled, and serially 
diluted 10-fold in mouse genomic DNA (Clontech, Palo Alto, CA) 
at a constant concentration of 2 ng/ul. The standard curve used for 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging from 10" 7 (10 5 copies of each gene) to 
10~ 10 (10 2 copies). This series of diluted PCR products was 
aliquoted and stored at -80°C until use. 

The standard curve was validated by analyzing 2 known 
quantities of calibrator human genomic DNA (20 ng and 50 hg). 

PCR amplification. Amplification mixes (50 ul) contained the 
sample DNA (around 20 ng, around 6600 copies of disomic genes), 
10X TaqMan buffer (5 ul), 200 uM dATP, dCTP, dGTP, and 400 
uM dUTP, 5 mM MgCI 2 , 1 .25 units of AmpliTaq Gold, 0.5 units of 
AmpErase uracil N-glycosylase (UNG), 200 nM each primer and 
100 nM probe. The thermal cycling conditions comprised 2 min at 
50°C and 1 0 min at 95°C. Thermal cycling consisted of 40 cycles at 
95°C for 15 s and 65°C for 1 min. Each assay included: a standard 
curve (from 10 5 to 10 2 copies) in duplicate, a no-template control, 
20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 
triplicate, and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus be analyzed on a 96-well microplate). All 
samples with a coefficient of variation (CV) higher than 10% were 
retested. 

All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fluorogenic probe during PCR. 

Equipment for real-time detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each of the 96 sample wells. A charge-coupled-device (CDD) 
camera collects the emission from each sample and the data are 
analyzed automatically. The software accompanying the 7700 
system calculates C t and determines the starting copy number in the 
samples. 
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Determination of gene amplification. Gene amplification was 
calculated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. . 

RESULTS 

To validate the method, real-time PCR was performed on 
genomic DNA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndl and erbBl proto-oncogenes, 
and the p-amyloid precursor protein gene (app), which maps to a 
chromosome region (2 lq2 1 .2) in which no genetic alterations have 
been found in breast tumors (Kallioniemi et ai, 1994). The 
reference disomic gene was the albumin gene (alb, chromosome 
4qll-ql3). 



Validation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was constructed from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ng/ul. It should be noted that the 5 primer pairs chosen to analyze 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown). Figure 1 shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing as few as 10 2 copies or as 
many as 10 5 copies. 

Copy-number ratio of the 2 reference genes (app and d\b) 

The app to alb copy-number ratio was determined in 1 8 normal 
leukocyte DNA samples and all 108 primary breast-tumor DNA 
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Figure 1 - Albumin (alb) gene dosage by real-time PCR. Top: Amplification plots for reactions with starting alb gene copy number ranging 
from 10 5 (A9), 10 4 (A7), 10 3 (A4) to 10 2 (A2) and a no-template control (Al). Cycle number is plotted vs. change in normalized reporter signal 
(ARn). For each reaction tube, the fluorescence signal of the reporter dye (FAM) is divided by the fluorescence signal of the passive reference dye 
(ROX), to obtain a ratio defined as the normalized reporter signal (Rn). ARn represents the normalized reporter signal (Rn) minus the baseline 
signal established in the first 15 PCR cycles. ARn increases during PCR as alb PCR product copy number increases until the reaction reaches a 
plateau. C, (threshold cycle) represents the fractional cycle number at which a significant increase in Rn above a baseline signal (horizontal black 
line) can first be detected. Two replicate plots were performed for each standard sample, but the data for only one are shown here. Bottom: 
Standard curve plotting log starting copy number vs. C, (threshold cycle). The black dots represent the data for standard samples plotted in 
duplicate and the red dots the data for unknown genomic DNA samples plotted in triplicate. The standard curve shows 4 orders of linear dynamic 
range. 
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samples. We selected these 2 genes because they are located in 2 
chromosome regions (app, 2 1 q2 1 .2; alb, 4qll-ql3) in which no 
obvious genetic changes (including gains or losses) have been 
observed in breast cancers (Kallioniemi et ai, 1994). The ratio for 
the 18 normal leukocyte DNA samples fell between 0.7 and 1.3 
(mean 1.02 ± 0.21), and was similar for the 108 primary breast- 
tumor DNA samples (0.6 to 1.6, mean 1.06 ± 0.25), confirming 
that alb and app are appropriate reference disomic genes for 
breast-tumor DNA. The low range of the ratios also confirmed that 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject's DNA would have resulted in differential amplification. 

myc, ccndl and erb52 gene dose in normal leukocyte DNA 

To determine the cut-off point for gene amplification in breast- 
cancer tissue, 1 8 normal leukocyte DNA samples were tested for 
the gene dose (N), calculated as described in "Material and 
Methods". The N value of these samples ranged from 0.5 to 1.3 
(mean 0.84 ± 0.22) for myc, 0.7 to 1.6 (mean 1.06 ± 0.23) for 
ccndl and 0.6 to 1.3 (mean 0.91 ±0.19) for erbBl. Since N values 
for myc, ccndl and erbB2 in normal leukocyte DNA consistently 
fell between 0,5 and 1 .6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. 

myc, ccnd 1 and etbB2 gene dose in breast-tumor DNA 

myc, ccndl and erbBl gene copy numbers in the 108 primary 
breast tumors are reported in Table I. Extra copies of ccndl were 
more frequent (23%, 25/108) than extra copies of erbBl (15%, 
16/108) and myc (10%, 11/108), and ranged from 2 to 18.6 for 
ccndl, 2 to 15.1 for erbBl, and only 2 to 4.6 for the myc gene. 
Figure 2 and Table IT represent tumors in which the ccndl gene was 
amplified 16-fold (T145), 6-fold (T133) and non-amplified (T118). 
The 3 genes were never found to be co-amplified in the same tumor. 
erbBl and ccndl were co-amplified in only 3 cases, myc and ccndl 
in 2 cases and myc and erbBl in 1 case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 50% in the 
erbBl copy number (N < 0.5), suggesting that they bore deletions 
of the 17q21 region (the site of erbBl). No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

. Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southern-blot analysis of myc, ccndl and erbBl amplifications 
had previously been done on the same 1 08 primary breast tumors. A 
perfect correlation between the results of real-time PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N £: 5). However, there were cases (1 myc, 6 ccndl and 4 erbBl) 
in which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2.9). 

DISCUSSION 

The clinical applications of gene amplification assays are 
currently limited, but would certainly increase if a simple, standard- 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TABLE 1 - DISTRIBUTION OF AMPLIFICATION LEVEL (N) FOR myc, 
ccnd J AND erbBl GENES IN 108 HUMAN BREAST TUMORS 



Gene 




Amplification level (N) 




<0.5 


0.5-1.9 2—4.9 




myc 


0 


97 (89.8%) 11 (10.2%) 


0 


ccndl 


0 


83 (76.9%) 17(15.7%) 


8 (7.4%) 


erbBl 


5 (4.6%) 


87 (80.6%) 8 (7.4%) 


8 (7.4%) 



reagents and requires relatively large amounts of high-quality 
genomic DNA. which means it cannot be used routinely in many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minimal 
quantities of tumor DNA (small early-stage tumors, cytopuncture 
specimens or formalin-fixed, paraffin-embedded tissues). 

In this study, we validated a PCR method developed for the 
quantification of gene over-representation in rumors. The method, 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-based quantitative assays such as 
competitive quantitative PCR (Celi et ai, 1994). First, the real-time 
PCR method is performed in a closed-tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of 
carryover PCR products in subsequent experiments can also be 
prevented by using the enzyme uracil N-glycosylase (UNG) 
(Longo et ai, 1 990). The second advantage is the simplicity and 
rapidity of sample analysis, since no post-PCR manipulations are 
required. Our results show that the automated method is reliable. 
We found it possible to determine, in triplicate, the number of 
copies of a target gene in more than 100 tumors per day. Third, the 
system has a linear dynamic range of at least 4 orders of magnitude, 
meaning that samples do not have to contain equal starting amounts 
of DNA. This technique should therefore be suitable for analyzing 
formalin-fixed, paraffin-embedded tissues. Fourth, and above all, 
real-time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C, values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the ABI Prism 7700 Sequence Detection System enables C, to be 
calculated when PCR amplification is still in the exponential phase 
and when none of the reaction components is rate-limiting. The 
within-run CV of the C, value for calibrator human DNA (5 
replicates) was always below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
C t ratio of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-time PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is laborious). 

The only potential disavantage of real-time PCR. like all other 
PCR-based methods and solid-matrix blotting techniques (South- 
ern blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from tumor cells contained in 
heterogeneous tissue specimens. Only FISH and immunohistochem- 
istry can measure alterations on a cell-by-cell basis (Pauletti et ai, 
1996; Slamon et ai, 1989). However, FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FISH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in 
the absence of gene amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immunohistochemistry is 
subject to considerable variations in the hands of different teams, 
owing to alterations of target proteins during the procedure, the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in 
the literature. (/) Chromosome regions 4qll-ql3 and 21q21.2 
(which bear alb and app, respectively) showed no genetic alter- 
ations in the breast-cancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et ai, 1994). (ii) We found that 
amplifications of these 3 oncogenes were independent events, as 
reported by other teams (Berns et ai, 1 992; Borg et ai, 1 992). (Hi) 
The frequency and degree of myc amplification in our breast tumor 
DNA series were lower than those of ccndl and erbBl amplifica- 
tion, confirming the findings of Borg et ai (1 992) and Courjal et ai 
(1997). (iv) The maxima of ccndl and erbBl over-representation 
were 1 8-fold and 1 5-fold, also in keeping with earlier results (about 
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Figure 2 - ccneiJ and alb gene dosage by real-time PCR in 3 breast tumor samples: Tl 1 8 (E 1 2, C6, black squares), T 1 33 (G 11 . B4, red squares) 
and T145 (A8, C8, blue squares). Given the C, of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experiment. Triplicate plots were performed for each tumor sample, but the data for only one are shown here. The results are shown in Table II. 



30-fold maximum) (Berns et ai, 1992; Borg et al., 1992; Courjal et 
al, 1997). (v) The erbB2 copy numbers obtained with real-time 
PCR were in good agreement with data obtained with other 
quantitative PCR-based assays in terms of the frequency and 
degree of amplification (An et ai, 1 995; Deng et ai, 1996; Valeron 



et ai, 1996). Our results also correlate well with those recently 
published by Gelmini et al. ( 1 997), who used the TaqMan system to 
measure erbB2 amplification in a small series of breast tumors 
(n = 25), but with an instrument (LS-50B luminescence spectrom- 
eter, Perkin-Elmer Applied Biosystems) which only allows end- 
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- EXAMPLES OF ccndi GENE DOSAGE RESULTS 
FROM 3 BREAST TUMORS' 



Tumor 




ccndl 






alb 




Nccndl/alb 


Copy 
number 


Mean 


SD 


Copy 
number 


Mean 


SD 


Til 8 


4525 






4223 










4605 


4603 


77 


4365 


4325 


89 


1.06 




4678 






4387 








T133 


59821 






9787 










61659 


61100 


1111 


10092 


10137 


375 


6.03 




61821 






10533 








T145 


128563 






7321 










125892 


125392 


3448 


7762 


7672" 


316 


16.34 




121722 






7933 









'For each sample, 3 replicate experiments were performed and the mean 
and the standard deviation (SD) was determined. The level of ccndl gent 
amplification (NccndJlalb) is determined by dividing the average ccndi 
copy namber value by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(>5-fold). The slightly higher frequency of gene amplification 
(especially ccndl and erbB2) observed by means of real-time 
quantitative PCR as compared with Southem-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, tetrasomy or polysomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et ai, 1992; 
Slamon era/., 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erbB2 (but not of the other 2 proto- 
oncogenes) in several tumors; erb&2 is located in a chromosome 
region (17q21) reported to contain both deletions and amplifica- 
tions in breast cancer (Bieche and Lidereau, 1995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre-neoplasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment. Southern blotting 
is not sufficiently sensitive, and FISH is lengthy and complex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a place in routine clinical 
gene dosage. 
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Biol. 9,215-218 (1999). 



6 




50. Ashkenazi. A ., Pai, R., Fong, s., Leung, S., Lawrence, D., Marsters, S., Blackie, 
C, Chang, L., McMurtrey, A., Hebert, A., DeForge, L., Khoumenis, I., Lewis, D., 
Harris, L., Bussiere, J., Koeppen, H., Shahrokh, Z., and Schwall, R. Safety and 
anti-tumor activity of recombinant soluble Apo2 ligand. J. Clin. Invest. 104, 155- 
162(1999). 

51. Chuntharapai, A., Gibbs, V., Lu, J., Ow, A., Marsters, S., Ashkenazi, A., De Vos, 
A., Kim, K.J. Determination of residues involved in ligand binding and signal 
transmissiion in the human IFN-cc receptor 2. J. Immunol. 163, 766-773 (1999). 

52. Johnsen, A.-C, Haux, J., Steinkjer, B., Nonstad, U., Egeberg, K., Sundan, A., 
Ashkenazi. A., and Espevik, T. Regulation of Apo2L/TRAlL expression in NK 

. cells - involvement in NK cell-mediated cytotoxicity. Cytokine 11, 664-672 
(1999). 

53. Roth, W., Isenmann, S., Naumann, U., Kugler, S., Bahr, M., Dichgans, J a 
Ashkenazi. A., and Weller, M. Eradication of intracranial human malignant 
glioma xenografts by Apo2L/TRAIL. Biochem. Biophys. Res. Commun. 265, 479- 
483(1999). 

54. Hymowitz, S.G., Christinger, H.W., Fuh, G., Ultsch, M., O'Connell, M., Kelley, 
R.F., Ashkenazi. A. and de Vos, A.M. Triggering Cell Death: The Crystal 
Structure of Apo2L/TRAIL in a Complex with Death Receptor 5. Molec. Cell 4, 
563-571 (1999). 

55. Hymowitz, S.G., O'Connel, M.P., Utsch, M.H., Hurst, A., Totpal, K., Ashkenazi, 

de Vos, A.M., Kelley, R.F. A unique zinc-binding site revealed by a high- 
resolution X-ray structure of homotrimeric Apo2I/rRAIL. Biochemistry 39, 633- 
640(2000). 

.56. Zhou, Q., Fukushima, P., DeGraff, W., Mitchell, J.B., Stetler-Stevenson, M., 
Ashkenazi.-A.. and Steeg, P.S. Radiation and the Apo2L/TRAIL apoptotic 
pathway preferentially inhibit the colonization of premalignant human breast 
cancer cells overexpressing cyclin D 1 . Cancer Res. 60, 26 1 1 -26 1 5 (2000). 

57. Kischkel, F.C., Lawrence, D. A., Chuntharapai, A., Schow, P., Kim, J., and 
Ashkenazi. A. Apo2L/TRAIL-dependent recruitment of endogenous FADD and 
Caspase-8 to death receptors 4 and 5. Immunity 12, 61 1-620 (2000). 

58. Yan, M., Marsters, S.A., Grewal, I.S., Wang, H., * Ashkenazi. A., and *Dixit, 
V.M. Identification of a receptor for BlyS demonstrates a crucial role in humoral 
immunity. Nature Immunol. 1,37-41 (2000). 
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59. Marsters, S.A., Yan, M., Pitti, R.M., Haas, P.E., Dixit, V.M., and Ashkenazi. A. 
Interaction of the TNF homologues BLyS and APRIL with the TNF receptor 
homologues BCMA and TACI. Curr. Biol. 10, 785-788 (2000). 

60. Kischkel, F.C., and Ashkenazi. A . Combining enhanced metabolic labeling with 
immunoblotting to detect interactions of endogenous cellular proteins. 
Biotechniques 29, 506-512 (2000). 

61. Lawrence, D., Shahrokh, Z., Marsters, S., Achilles, K., Shih, D. Mounho, B., 
Hillan, K., Totpal, K. DeForge, L., Schow, P., Hooley, J., Sherwood, S., Pai, R., 
Leung, S., Khan, L., Gliniak, B., Bussiere, J., Smith, C, Strom, S., KeHey* S., 
Fox, J., Thomas, D., and Ashkenazi. A. Differential hepatocyte toxicity of 
recombinant Apo2L/TRAIL versions. Nature Med. 7, 383-385 (2001). 

62. . Chuntharapai, A., Dodge, K., Grimmer, K., Schroeder, K., Martsters, S.A., 

Koeppen, H., Ashkenazi. A ., and Kim, K.J. Isotype-dependent inhibition of 
tumor growth in vivo by monoclonal antibodies to death receptor 4. J. Immunol. 
166,4891-4898(2001). 

63. Pollack, I.F., Erff, M., and AshkenazL_A. Direct stimulation of apoptotic 
signaling by soluble Apo2L/tumor necrosis factor-related apoptosis-inducing 
ligand leads to selective killing of glioma cells. Clin. Cancer Res. 1, 1362-1369 
(2001). 

64. Wang, H., Marsters, S.A., Baker, T., Chan, B., Lee, W.P., Fu, L., Tumas, D., Yan, 
M.: Dixit. V.M.. * Ashkenazi. A ., and *GrewaU I.S. TACMigand interactions are 
required for T cell activation and collagen-induced arthritis in mice. Nature 
Immunol. 2, 632-637 (2001): 

65. Kischkel, F.C., Lawrence, D. A., Tinel, A., Virmani, A., Schow, P., Gazdar, A., 
Blenis, J., Arnott, D., and Ashkenazi, A . Death receptor recruitment of 

. endogenous caspase- 10 and apoptosis initiation in the absence of caspase-8. 7. 
Biol. Chem. 276, 46639-46646 (2001). 

66. LeBlanc, H., Lawrence, D.A., Varfolomeev, E., Totpal, K., Morlan, J., Schow, P., 
Fong, S:, Schwall, R., Sinicropi, D., and Ashkenazi. A T umor cell resistance to 
death receptor induced apoptosis through mutational inactivation of the 
proapoptotitc Bcl-2 homolog Bax. Nature Med. 8, 274-28 1 (2002). 

67. Miller, K., Meng, G., Liu, J., Hurst, A., Hsei, V., Wong, W-L., Ekert, R., 
Lawrence, D., Sherwood, S., DeForge, L., Gaudreault., Keller, G., Sliwkowski, 
M., Ashkenazi. A ., and Presta, L. Design, Construction, and analyses of 
multivalent antibodies. J. Immunol. 170, 4854-4861 (2003). 
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68. Varfolomeev, E., Kischkel, F., Martin, F., Wanh, H., Lawrence, D., Olsson, C, 
Tom, L., Erickson, S-, French, D., Schow, P., Grewal, I. and Ashkenazi, A. 
Immune system development in APRIL knockout mice. Submitted. 

Review articles: 

1 . Ashkenazi, A., Peralta, E., Winslow, J., Ramachandran, J., and Capon, D., J. 
Functional role of muscarinic acetylcholine receptor subtype diversity. Cold 
Spring Harbor Symposium on Quantitative Biology. LIII, .263-272 (1988). 

2. Ashkenazu A ., Peralta, E., Winslow, J., Ramachandran, J., and Capon, D. 
Functional diversity of muscarinic receptor subtypes in cellular signal 
transduction and growth. Trends Pharmacol Sou Dec Supplement, 12-21 (1989). 

3. Chamow, S., Duliege, A., Ammann, A., Kahn, J., Allen, D., Eichberg, J., Byrn, 
R., Capon, D., Ward, R., and Ashkenazi, A . CD4 immunoadhesins in anti-HIV 
therapy: new developments. Int. J. Cancer Supplement 7, 69-72 (1992). 

4. ' AshkenazLA., Capon, and D. Ward, R. Immunoadhesins. Int. Rev. Immunol 10, 

217-225 (1993). 

5. Ashkenazi, A ., and Peralta, E. Muscarinic Receptors. In Handbook of Receptors 
and Channels. (S. Peroutka, ed.), CRC Press, Boca Raton, Vol. I, p. 1-27, (1994). 

6. Krantz, S. B., Means, R. T., Jr., Lina, J., Marsters, S. A., and Ashkenazi, A. 
Inhibition of erythroid colony formation in vitro by gamma interferon. In 
Molecular Biology of Hematopoiesis (N. Abraham, R. Shadduck, A. Levine F. 
Takaku, eds ) Intercept Ltd Paris, Vol. 3, p. 135-147 (1994). 

7. Ashkenazi A . Cytokine neutralization as a potential therapeutic approach for 
SIRS and shock. J. Biotechnology in Healthcare 1, 197-206 (1994). . 

8. AshkenazLA., and Chamow, S. M. Immunoadhesins: an alternative to human 
monoclonal antibodies. Immunomethods: A companion to Methods in 
Enzimology 8, 104-115 (1995). 

9. Chamow, S., and Ashkenazi, A . Immunoadhesins: Principles and Applications. 
Trends Biotech. 14, 52-60 (1996). 

10. Ashkenazi, A ., and Chamow, S. M. Immunoadhesins as research tools and 
therapeutic agents. Curr. Opin. Immunol 9, 195-200 (.1997). 

11. AshkenazLA., and Dixit, V. Death receptors: signaling and modulation. Science 
281, 1305-1308 (1998). . 

12. Ashkenazi, A ., and Dixit, V. Apoptosis control by death and decoy receptors. 
Curr. Opin. Cell Biol 11, 255-260 (1999). 
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13. Ashkenazu A . Chapters on Apo2I/TRAIL; DR4, DR5, DcRl, DcR2; and DcR3. 
Online Cytokine Handbook (www.apnet.com/cvtok inereference/). 

1 4. Ashkenazi, A . Targeting death and decoy receptors of the tumor necrosis factor 
superfamily. Nature Rev. Cancer 2, 420-430 (2002). 

15. LeBlanc, H. and Ashkenazu A . Apoptosis signaling by Apo2L/TRAJL. Cell Death 
and Differentiation 10,66-75(2003). 

16. Almasan, A. and Ashkenazt A . Apo2L/TRAIL: apoptosis signaling, biology, and 
potential for cancer therapy. Cytokine and Growth Factor Reviews 14, 337-348 
(2003). * . 

Book: 

Antibody Fusion Proteins (Chamow, S., and Ashkenazi, A ., eds., John Wiley and 
Sons Inc.) (1999). : 

Talks: 

1 . Resistance of primary HIV isolates to CD4 is independent of CD4-gpl20 binding 
affinity. UCSD Symposium, HIV Disease: Pathogenesis and Therapy. 
Greenelefe, FL, March 1991. 

2. Use of immuno-hybrids to extend the half-life of receptors. EBC conference on 
Biopharmaceutical Halflife Extension. New Orleans, LA, June 1992. 

3 . Results with TNF receptor Immunoadhesins for the Treatment of Sepsis. D3C 
conference on Endotoxemia and Sepsis. Philadelphia, PA, June 1992. 

4. Immunoadhesins: an alternative to human antibodies. IBC conference on 
Antibody Engineering. San Diego, CA, December 1993. 

5. Tumor necrosis factor receptor: a potential therapeutic for human septic shock. 
American Society for Microbiology Meeting, Atlanta, GA, May 1 993 . 

6. Protective efficiacy of TNF receptor immunoadhesin vs anti-TNF monoclonal 
antibody in a rat model for endotoxic shock. 5th International Congress on TNF. 
Asilomar, CA, May 1994. 

7. Interferon-y signals via a multisubunit receptor complex that contains two types of 
polypeptide chain. American Association of Immunologists Conference. San 
Franciso, CA, July 1995. 

8. Immunoadhesins: Principles and Applications. Gordon Research Conference on 
Drug Delivery in Biology and Medicine. Ventura, CA, February 1996. 
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9. Apo-2 Ligand, a new member of the TNF family that induces apoptosis in tumor 
cells. Cambridge Symposium on TNF and Related Cytokines in Treatment of 
Cancer. Hilton-Head, NC, March 1996. 

1 0. Induction of apoptosis by Apo2 Ligand. American Society for Biochemistry and 
Molecular Biology, Symposium on Growth Factors and Cytokine Receptors. New 

Orleans, LA, June, 1996. 
1L Apo2 ligand, an extracellular trigger of apoptosis. 2nd Clontech Symposium, 

Palo Alto, CA, October 1996. 

12. Regulation of apoptosis by members of the TNF ligand and receptor families. 
Stanford University School of Medicine, Palo Alto, CA, December 1996. 

13. Apo-3: anovel receptor that regulates cell death and inflammation. 4th 
International Congress on Immune Consequences of Trauma, Shock, and Sepsis. 
Munich, Germany, March 1997. 

1 4. New members of the TNF ligand and receptor families that regulate apoptosis, 
inflammation, and immunity. UCLA School of Medicine, LA, CA, March 1997. 

15. Immunoadhesins: ah alternative to monoclonal antibodies. 5th World Conference 
on Bispecific Antibodies. Volendam, Holland, June 1997. 

16. Control of Apo2L signaling. Cold Spring Harbor Laboratory Symposium on 
Programmed Cell Death. Cold Spring Harbor, New York. September, 1997. 

17. Chairman and speaker, Apoptosis Signaling session. IBC's 4th Annual 
Conference on Apoptosis. San Diego, CA., October 1997. 

18. Control of Apo2L signaling by death and decoy receptors. American Association 
for the Advancement of Science. Philadelphia, PA, February 1998. 

19. Apo2 ligand and its receptors. American Society of Immunologists. San 

Francisco, CA, April 1998- 

20. Death receptors and ligands. 7th International TNF Congress. Cape Cod, MA, 

May 1998. 

21. Apo2L as a potential therapeutic for cancer. UCLA School of Medicine. LA, 
... ^ CA, June 1998. 

22. Apo2L as a potential therapeutic for cancer. Gordon Research Conference on 
Cancer Chemotherapy. New London, NH, July 1998. 

23: Control of apoptosis by Apo2L. Endocrine Society Conference, Stevenson, WA, 
August 1998. 

24. Control of apoptosis by Apo2L. International Cytokine Society Conference, 
Jerusalem, Israel, October 1998. 
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25. Apoptosis control by death and decoy receptors. American Association for 
Cancer Research Conference, Whistler, BC, Canada, March 1999. 

26. Apoptosis control by death and decoy receptors. American Society for 
Biochemistry and Molecular Biology Conference, San Francisco, CA, May 1999. 

27. Apoptosis control by death and decoy receptors. Gordon Research Conference on 
Apoptosis, New London, NH, June 1999. 

28. Apoptosis control by death and decoy receptors. Arthritis Foundation Research . 
Conference, Alexandria GA, Aug 1999. 

29. Safety and anti-tumor activity of recombinant soluble Apo2L/TRAIL. Cold 
Spring Harbor Laboratory Symposium on Programmed Cell Death. . Cold Spring 
Harbor, NY, September 1999. 

30. The Apo2L/TRAIL system: therapeutic potential. American Association for 
Cancer Research, Lake Tahoe, NV, Feb 2000. 

31. Apoptosis and cancer therapy. Stanford University School of Medicine, Stanford, 
CA, Mar 2000. 

32. Apoptosis and cancer therapy. University of Pennsylvania School of Medicine, 
Philadelphia, PA, Apr 2000. 

33 . Apoptosis signaling by Apo2L/TRAJL. International Congress on TNF. 
Trondheim, Norway, May 2000. 

34. The Apo2L/TRAIL system: therapeutic potential. Cap-CURE summit meeting. 
Santa Monica, CA* June 2000. 

35. The Apo2L/TRAIL system: therapeutic potential. MD Anderson Cancer Center. 
Houston, TX, June 2000. 

36. Apoptosis signaling by Apo2L/TRAIL. The Protein Society, 14 th Symposium. 
San Diego, CA, August 2000. 

37. Anti-tumor activity of Apo2L/TRAIL. AAPS annual meeting. Indianapolis, IN 
Aug 2000. 

38. Apoptosis signaling and anti-cancer potential of Apo2L/TRAIL. Cancer Research 
Institute, UC San Francisco, CA, September 2000. 

39. Apoptosis signaling by Apo2L/TRAIL. Kenote address, TNF family 
Minisymposium, NIH. Bethesda, MD, September 2000. 

40. Death receptors: signaling and modulation. Keystone symposium on the 
Molecular basis of cancer. Taos, NM, Jan 2001 . 

41. Preclinical studies of Apo2L/TRAIL in cancer. Symposium on Targeted therapies 
in the treatment of lung cancer. Aspen, CO, Jan 2001. 
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42. Apoptosis signaling by Apt)2L/TRAIL. Wiezmann Institute of Science, Rehovot, 
Israel, March 2001. 

43. Apo2L/TRAIL: Apoptosis signaling and potential for cancer therapy. Weizmann 
Institute of Science, Rehovot, Israel, March 200 L 

44. Targeting death receptors in cancer with Apo2L/TRAIL. Cell Death and Disease 
conference, North Falmouth, MA, Jun 2001 . 

45 . Targeting death receptors in cancer with Apo2L/TRAJL. Biotechnology 
Organization conference, San Diego, CA, Jun 2001. 

46. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Gordon Research 
Conference on Apoptosis, Oxford, UK, July 2001 . 

47. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Cleveland Clinic 
Foundation, Cleveland, OH, Oct 2001. 

48. Apoptosis signaling by death receptors: overview. International Society for 
Interferon and Cytokine Research conference, Cleveland, OH, Oct 2001. 

49. * Apoptosis signaling by death receptors. American Society of Nephrology 

Conference. San Francisco, CA, Oct 2001. 

50. Targeting death receptors in cancer. Apoptosis: commercial opportunities. San 
Diego, CA, Apr 2002. 

51. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Kimrnel Cancer 
Research Center, Johns Hopkins University, Baltimore MD. May 2002. 

52. Apoptosis control by Apo2L/TRACL. (Keynote Address) University of Alabama 
Cancer Center Retreat, Birmingham, Ab. October 2002. 

53. Apoptosis signaling by Apo21/TRAIL. (Session co-chair) TNF international, 
conference. San Diego, CA. October 2002. 

54. Apoptosis signaling by Apo2L/TRAIL. Swiss Institute for Cancer Research 
(ISREC). Lausanne, Swizerland. Jan 2003. 

55. Apoptosis induction with Apo2L/TRAIL. Conference on New Targets and 
Innovative Strategies in Cancer Treatment. Monte Carlo. February 2003. 

56. Apoptosis signaling by Apo2L/TRAIL. Hermelin Brain Tumor Center . 
Symposium on Apoptosis. Detroit, MI. April 2003. 

57. Targeting apoptosis through death receptors. Sixth Annual Conference, on 
Targeted Therapies in the Treatment of Breast Cancer. Kona, Hawaii. July 2003. 

58. Targeting apoptosis through death receptors. Second International Conference on 
Targeted Cancer Therapy. Washington, DC. Aug 2003. 

Issued Patents: ' 



13 




1. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,329,028 (Jul 12, 1994). 

2. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,605,791 (Feb 25, 1997). 

3. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,889,155 (Jul 27, 1999): 

4. Ashkenazi, A., APO-2 Ligand. US patent 6,030,945 (Feb 29, 2000). 

5. Ashkenazi, A., Chuntharapai, A., Kim, J., APO-2 ligand antibodies. US patent 6, 
046, 048 (Apr 4, 2000). 

6. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 6,124,435 (Sep 26, 2000). 

7. Ashkenazi, A., Chuntharapai, A., Kim, J., Method for making monoclonal and cross- 
reactive antibodies. US patent 6,252,050 (Jun 26, 2001). 

8. Ashkenazi, A. APO-2 Receptor. US patent 6,342,369 (Jan 29, 2002). 

9. Ashkenazi, A. Fong, S., Goddard, A., Gurney, A., Napier, M., Tumas, D., Wood, W: 
A-33 polypeptides. US patent 6,410,708 (Jun 25, 2002). 

10. Ashkenazi, A. APO-3 Receptor. US patent 6,462,176 Bl (Oct 8, 2002). 

11. Ashkenazi, A. APO-2LI and APO-3 polypeptide antibodies. US patent 6,469, 1 44 B 1 
(Oct 22, 2002). 

12. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 6,582,928B1 (Jun 24, 2003). 
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DECLARATION OF PAUL POLAKIS, PhD. 
I, Paul Polakis, Ph.D., declare and say as follows: 

1 . I was awarded a Ph.D. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins'*. When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA 5 mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely usefiil for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 



thereon. 





Paul Polakis, Ph.D. 
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PUBLICATIONS: 

1. Polakis, P G. and Wilson, J. E. 1982 Purification of a Highly Bindable Rat Brain 
Hexokinase by High Performance Liquid Chromatography. Biochem. Biophys. 
Res. Commun. 107, 937-943. 

2. Polakis, P.G. and Wilson, J. E. 1984 Proteolytic Dissection of Rat Brain 
Hexokinase: Determination of the Cleavage Pattern during Limited Digestion with 
Trypsin. Arch. Biochem. Biophys. 234, 341-352. 

3. Polakis, P. G. and Wilson, J. E. 1985 An Intact Hydrophobic N-Terminal 
Sequence is Required for the Binding Rat Brain Hexokinase to Mitochondria. Arch. 
Biochem. Biophys. 236, 328-337. 

4. Uhing, R.J., Polakis,P.G. and Snyderman, R. 1987 Isolaton of GTP-binding 
Proteins from Myeloid HL60 Cells. J. Biol. Chem. 262, 15575-15579. 

5. Polakis, P.G., Uhing, R.J. and Snyderman, R. 1988 The Formylpeptide 
Chemoattractant Receptor Copurifies with a GTP-binding Protein Containing a 
Distinct 40 kDa Pertussis Toxin Substrate. J. Biol. Chem. 263, 4969-4979. 

6. Uhing, R. J., Dillon, S., Polakis, P. G., Truett, A. P. and Snyderman, R. 1988 
Chemoattractant Receptors and Signal Transduction Processes in Cellular and 
Molecular Aspects of Inflammation ( Poste, G. and Crooke, S. T. eds.) pp 335-379. 

7. Polakis, P.G., Evans, T. and Snyderman 1989 Multiple Chromatographic Forms 
of the Formylpeptide Chemoattractant Receptor and their Relationship to GTP- 
binding Proteins. Biochem. Biophys. Res. Commun. 161, 276-283. 

8. Polakis, P. G., Snyderman, R. and Evans, T. 1989 Characterization of G25K, a 
GTP-binding Protein Containing a Novel Putative Nucleotide Binding Domain. 
Biochem. Biophys. Res. Comun. 160, 25-32. 

9. Polakis, P., Weber,R.F., Nevins,B., Didsbury, J. Evans,T. and Snyderman, R. 
1989 Identification of the ral and rad Gene Products, Low Molecular Mass GTP- 
binding Proteins from Human Platelets. J. Biol. Chem. 264, 16383-16389. 

10. Snyderman, R., Perianin, A., Evans, T., Polakis, P. and Didsbury, J. 1989 G 
Proteins and Neutrophil Function. In ADP-Ribosylating Toxins and G Proteins: * 
Insights into Signal Transduction. ( J. Moss and M. Vaughn, eds.) Amer. Soc. 
Microbiol, pp. 295-323. 



11. Hart, M.J., Polakis, P^P Evans, T. and Cerrione, RA 1990^R Identification 
and Charaterization of an Epidermal Growth Factor-Stimulated Phosphorylation of a 
Specific Low Molecular Mass GTP-binding Protein in a Reconstituted Phospholipid 
Vesicle System. J. Biol. Chem. 265, 5990-6001 . 

12. Yatani, A., Okabe, K., Polakis, P. Halenbeck, R. McCormick, F. and Brown, A. 
M. 1990 ras p21 and GAP Inhibit Coupling of Muscarinic Receptors to Atrial K + 
Channels. Cell. 61, 769-776. 

13. Munemitsu, S., Innis, M.A., Clark, R., McCormick, F., Ullrich, A. and Polakis, 
P.G. 1990 Molecular Cloning and Expression of a G25K cDNA, the Human Homolog 
of the Yeast Cell Cycle Gene CDC42. Mol. Cell. Biol. 1 0, 5977-5982. 

14. Polakis, P.G. Rubinfeld, B. Evans, T. and McCormick, F. 1991 Purification of 
Plasma Membrane-Associated GTPase Activating Protein (GAP) Specific for rap- 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Torben F. 0rntoftt§, Thomas ThykjaerU, Frederic M. Waldman||, Hans Wolf**, 
and Julio E. Celis±t 



Gain and toss of chromosomal material is characteristic 
of bladder cancer, as well as malignant transformation in 
general. The consequences of these changes at both the 
transcription and translation levels is at present unknown 
partly because of technical limitations. Here we have at- 
tempted to address this question in pairs of non-invasive 
and invasive human bladder tumors using a combination 
of technology that included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes), and high resolution 



phenomenon at both the transcription and translation levels. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there is a correlation between 
DNA copy numbers and gene expression in highly amplified 
areas (2), and studies of individual genes in solid tumors 
have revealed a good correlation between gene dose and 
mRNA or protein levels in the case of c-erb-B2, cyclin d1, 
emsU and N-myc (3-5). However, a high cyclin D1 protein 
expression has been observed without simultaneous am- 



two-dimensional gel electrophoresis^ results showed^P'^tion (4), and a low level of c-myc copy number In- 
■ « * • °- crease was observed without concomitant c-myc protein 



that there is a gene dosage effect that in some cases 
superimposes on other regulatory mechanisms. This ef- 
fect depended (p < 0.015) on the magnitude of the com- 
parative genomic hybridization change. In general (18 of 
23 cases), chromosomal areas with more than 2-fold gain 
of DNA showed a corresponding increase in mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels) Be- 
cause most proteins resolved by two-dimensional gels 
are unknown It was only possible to compare mRNA and 
protein alterations irx relatively few cases of well focused 
abundant proteins. With few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The implications, as well as limitations, 
of the approach are discussed. Molecular & Cellular 
Proteomics 1:37-45, 2002. 



Aneuploidy is a common feature of most human cancers 
(1), but little is known about the genome-wide effect of this 
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overexpression (6). 

In human bladder tumors, karyotyping, fluorescent in situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of non-invasive pTa transitional cell carcinomas (TCCs), 
this includes loss of chromosome 9 or parts of it, as well as 
loss of Y in males. In minimally invasive pT1 TCCs, the fol- 
lowing alterations have been reported: 2q-, 11p™, lq+, 
11q13+, 17q+, and 2Gq+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes arid onco- 
genes; however, the large chromosomal areas involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult. 

In this investigation we have combined genome-wide tech- 
nology for detecting genomic gains and losses (CGH) with 
gene expression profiling techniques (microarrays and pro- 
teomics) to determine the effect of gene copy number on 
transcript and protein levels in pairs of non-invasive and in- 
vasive human bladder TCCs. 

EXPERIMENTAL PROCEDURES 

Material— Bladder tumor biopsies were sampled after informed 
consent was obtained and after removal of tissue for routine pathol- 
ogy examination. By light microscopy tumors 335 and 532 were 
staged by an experienced pathologist as pTa (superficial papillary), 

1 The abbreviations used are: CGH, comparative genomic hybrid- 
ization; TCC, transitional cell carcinoma; LOH, loss of heterozygosity; 
PA- FAB P, psoriasis-associated fatty acid-binding protein; 2D, 
two-dimensional. 
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Fig. 1. DNA copy number and mRNA expression level. Shown from left to right are chromosome (Chr.), CGH profiles, gene location and 
expression level of specific genes, and overall expression level along the chromosome. A, expression of mRNA in invasive tumor 733 as 
compared with the non-invasive counterpart tumor 335. 6, expression of mRNA in invasive tumor 827 compared with the non-Invasive 
counterpart tumor 532. The average fluorescent signal ratio between tumor DNA and normal DNA Is shown along the length of the chromosome 
(/eft). The bold curve in the ratio profile represents a mean of four chromosomes and is surrounded by thin curves indicating one standard 
deviation. The central vertical line Ifaroken) indicates a ratio value of 1 (no change), and the vertical lines next to it {dotted) indicate a ratio of 
0.5 (/eft) and 2.0 {fight). In chromosomes where the non-invasive tumor 335 used for comparison showed alterations in DNA content, the ratio 
profile of that chromosome is shown to the right of the invasive tumor profile. The colored bars represents one gene each, identified by the 
running numbers above the bars (the name of the gene can be seen at www.MDL.DK/sdata.html), The bars indicate the purported location of 
the gene, and the colors indicate the expression level of the gene in the invasive tumor compared with the non-invasive counterpart; >2-fold 
increase {black) , > 2-fold decrease {blue), no significant change (orange). The bar to the far right, entitled Expression shows the resulting change 
in expression along the chromosome; the colors indicate that at least half of the genes were up-regulated (b/ac/c), at least half of the genes 
down-regulated {blue), or more than half of the genes are unchanged {orange). If a gene was absent in one of the samples and present in 
another, it was regarded as more than a 2-fold change. A 2-fold level was chosen as this corresponded to one standard deviation in a double 
determination of ~1800 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade I and II, respectively, tumors 733 and 827 were staged as pT1 
(invasive into submucosal, 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparation— Tissue biopsies, obtained fresh from surgery, 
were embedded immediately in a sodium-guanidinlum thiocyanate 
solution and stored at -80 °C. Total RNA was isolated using the 
RNAzol B RNA isolation method (WAK-Chemie Medical GMBH). 
poly(A) + RNA was isolated by an oligo(dT) selection step (Oligotex 
mRNA kit; Qiagen). 

cRNA Preparation— 1 p.g of mRNA was used as starting material. 
The first and second strand cDNA synthesis was performed using the 
Superscript® choice system (Invitrogen) according to the manufac- 
turer's instructions but using an oligo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrip® in vitro transcription kit (Ambton). Biotin-labeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs in the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Qiagen). 

Array Hybridization and Scanning— Array hybridization and scan- 
ning was modified from a previous method (13), 10 fig of cRNA was 
fragmented at 94 °C for 35 min in buffer containing 40 mM- Tris 
acetate, pH B.1 , 1 00 mM KOAc, 30 mw MgOAc. Prior to hybridization, 
the fragmented cRNA in a 6x SSPE-T hybridization buffer (1 m NaCI, 
10 mM Tris, pH 7.6, 0.005% Triton), was heated to 95 °C for 5 min, 
subsequently cooled to 40 °C, and loaded onto the Affymetrix probe 
array cartridge. The probe array was then incubated for 16 h at 40 °C 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes In 6x SSPE-T at 25 °C followed by 4 washes in 0.5x SSPE-T 
at 50 °C. The biotinylated cRNA was stained with a streptavidin- 
phycoerythrin conjugate. 10 fig/ml (Molecular Probes) in 6x SSPE-T 
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Fig. 1— continued 



for 30 min at 25 °C followed by 1 0 washes in 6 X SSPE-T at 25 °C. The 
probe arrays were scanned at 560 nm using a confocat laser scanning 
microscope (made for Affymetrix by Hewlett-Packard), The readings 
from the quantitative scanning were analyzed by Affymetrix gene 
expression analysis software. 

Microsatellite Analysis— Microsatellite Analysis was performed as 
described previously (14). Microsatellites were selected by use of 
www.ncbLnlm.nih.gov/genemap98, and primer sequences were ob- 
tained from the genome data base at www.gdb.org. DNA was extracted 
from tumor and Wood and amplified by PCR in a volume of 20 for 35 
cycles. The amplicons were denatured and electrophoresed for 3 h in an 
ABI Prism 377, Data were collected in the Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as (ess than 33% 
of one allele detected in tumor amplicons compared with blood. 

Proteomic Analysis— TCCs were minced into small pieces and 
homogenized in a small glass homogenlzer in 0.5 ml of lysis solution. 
Samples were stored at -20 °C until use. The procedure for 2D gel 
electrophoresis has been described in detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or Coomassie Brilliant Blue. Pro- 
teins were identified by a combination of procedures that included 
microsequencing, mass spectrometry, two-dimensional gel Western 
immunobtotting, and comparison with the master two-dimensional gel 
image of human keratinocyte proteins; see biobase.dk/cgi-bin/celis. 

CGH— Hybridization of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Fluorescein-labeled tumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 /xg) were 
denatured at 37 °C for 5 min and applied to denatured normal met- 
aphase slides. Hybridization was at 37 °C for 2 days. After washing, 
the slides were counterstalned with 0.15 /ig/mi 4,6-diamidino-2-phe- 
nylindole in an anti-fade solution. A second hybridization was per- 
formed for all tumor samples using fluorescein-labeled reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during the initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital image analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and losses. The average 
green:red fluorescence intensity ratio profiles were calculated using 
four images of each chromosome (eight chromosomes total) with 
normalization of the green :red fluorescence intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4,6-diamidino-2-phenylindole band- 
ing patterns. Only images showing uniform high Intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
matic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization— The CGH analysis 
identified a number of chromosomal gains and losses in the 



Molecular & Cellular Proteomlcs 1.1 39 



Gene Copy Numbers, Transcripts, and Protein Levels 



Table I 

Correlation between alterations detected by CGH and by expression monitoring 
Top, CGH used as Independent variable (if CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable (if expression alteration - what CGH deviation was found). 



CGH alterations 



Tumor 733 vs. 335 
Expression change clusters 



Concordance CGH alterations 



Tumor 827 vs. 532 
Expression change clusters 
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10 Loss 



10 Up-regulation 

0 Down-regulation 

3 No change 
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two invasive tumors (stage pT1, TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33- f and X-, and 7+, 9q- ( 
and Y-, respectively. Both invasive tumors showed changes 
(1 q22-24+ , 2q1 4. 1 -qter- , 3q1 2-q1 3.3- , 6q1 2-q22 - , 
9q34+, 11q12-q13+, 17+, and 20q11.2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are shown in Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 in TCC 733 (Fig. ^A) and 
20q12 In TCC 827 (Fig. 18). 

mRNA Expression in Relation to DNA Copy Number-The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-invasive counterparts 
(TCCs 532 and 335). This was done in two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1 ,800 genes that yielded a signal on the arrays 
were searchedin the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way it was possible to construct a graphic presentation of 
. DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the invasive versus the non-invasive counterpart. Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the QGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
increased mRNA expression levels in the two tumor pairs (Fig. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an increased level of transcripts in both 
TCCs 733 (77%) and 827 (80%) (Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often regis- 
tered as having unaltered RNA levels (Table I, fop). The inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

in the second set of calculations we selected expression 
alterations above 2-fold as the independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that Increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected in areas with unaltered CGH ratios (Table I, bottom). 
Furthermore, as a control we looked at areas with no aiter- 



40 Molecular & Cellular Proteomics 1. 1 



Gene Copy Numbers, Transcripts, and Protein Levels 



5 

O 
£ 2 
0) 

9 



1 2.5 « 



•S 

e 



SI 3 ■ 



A 
AAA. 
AAA 
A A 



J.. 



Expression changes 
detected 



Expression changes 
not detected 



Expression changes 
detected 



Expression changes 
not detected 



Tumor 827 versus 532 Tumor 733 versus 335 

Fig. 2. Correlation between maximum CGH aberration and the ability to detect expression change by oligonucleotide array 
monitoring. The aberration is shown as a numerical -fold change in ratio between invasive tumors 827 (A) and 733 (♦) and their non-invasive 
counterparts 532 and 335, The expression change was taken from the Expression line to the right in Fig. 1, which depicts the resulting 
expression change for a given chromosomal region. At least half of the mRNAs from a given region have to be either up- or down-regulated 
to be scored as an expression change. Ali chromosomal arms in which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of one were included. 



ation in expression. No alteration was detected by CGH in 
most of these areas (TCC 733, 60% and TCC 827, 81 %; see 
Table I, bottom). Because the ability to observe reduced or 
increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations in the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2)CFpr both tumors TCC 733 (p < 0.015) and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the DNA copy number) and alterations detected by the array 
based technology (Fig. 2| Similar data were obtained when 
areas with altered expression were used as independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1 .6- to 2.0-fold (Table i, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction in expres- 
sion level, which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent. 

Microsatellite-based Detection of Minor Areas of Loss- 
es— In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression {see Fig. 1 ( TCC 733 
chromosome 1q32, 2p21 t and 7q21 and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material In these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two mlcrosatellites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Fig. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/in- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. 1 A). As indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the telomeric end of chro- 
mosome 11p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellites (D11S1760, D11S922) 
positioned close to MUC2, 1GF2, and cathepsin D indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed In TCC 733 at 
chromosomes 3q24, 1 1 p1 1 , 12p12.2, 12q21.1, and 16q24 
and in TCC 827 at chromosome 11p15.5, 12p11, 15q11.2, 
and 18q12 was also examined for chromosomal losses using 
microsateilites positioned as close as possible to the gene loci 
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Fig. 3. Microsatetlite analysis of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu class I histocompatibility antigen (gene 
number 38 in Fig. 1), (6) by D1S2735 close to cathepsin E (gene 
number 41 in Fig. 1), and (c) at chromosome 2p23 by D2S2251 close 
to general 0-spectrin (gene number 1 1 on Fig. 1) and of (d) tumor 827 
showing loss of heterozygosity at chromosome 18q12 by S18S1118 
close to mitochondrial 3-oxoacyl-coenzyme A thiolase (gene number 
12 in Fig. 1). The upper curves show the electropherogram obtained 
from normal DNA from leukocytes (N), and the lower curves show the 
electropherogram from tumor DNA (7). In all cases one allele is 
partially lost in the tumor amplicon. 

showing reduced mRNA transcripts. Only the microsatelilte 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional down-regulation of genes in the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels— 
2D- PAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating in areas away from the edges of the pH 
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Fig. 4. Correlation between protein levels as judged by 20- 
PAGE and transcript ratio. For comparison proteins were divided in 
three groups, unaltered in level or up- or down-regulated {horizontal 
axis). The mRNA ratio as determined by oligonucleotide arrays was 
plotted for each gene {vertical axis). A, mRNAs that were scored as 
present in both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent in the invasive tumors (along horizontal axis) or 
as absent in non-invasive reference {top of figure). Two different 
scalings were used to exclude scaling as a confounder, TCCs 827 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 (#0) were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences in mRNA ratios 
between the groups. Proteins shown, were as follows: Group A (from 
left), phosphoglucomutase 1 t glutathione transferase class p number 
4, fatty acid-binding protein homotogue, cytokeratin 15, and cyto- 
keratin 13; 8 (from left), fatty acid-binding protein homologue, 28-kDa 
heat shock protein, cytokeratin 13, and calcyclin; Ctfrom left), a-eno- 
lase, hnRNP B1, 28-kDa heat shock protein, 14-3-3-e, and 
pre-mRNA splicing factor, 0, mesothellal keratin K7 (type II); E (from 
top), glutathione S-transferase-tr and mesothelial keratin K7 (type II); 
F (from top and /eft), adenylyl cyclase-associated protein, E-cadherin, 
keratin 19, calgtzzarin, phosphoglycerate mutase, annexin IV, cy- 
toskeletal yactin, hnRNP A1, integral membrane protein calnexin 
(IP90), hnRNP H, brain-type clathrin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucleoprotein A/B, 
translationally controlled tumor protein, liver glyceraldehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na.K- 
ATPase 0-1 subunit; G, (from top and /eft). TCP20, calgizzarln, 70- 
kDa heat shock protein, calnexin, hnRNP H, cytokeratin 15, ATP 
synthase, keratin 19, triosephosphate fsomerase, hnRNP F, liver glyc- 
eraldehyde-3-phosphatase dehydrogenase, glutathione S-transfer- 
ase-77, and keratin 8; H (from left), plasma gelsolin, autoantigen cal- 
reticulin, thioredoxin, and NAD + -dependent 15 hydroxyprostaglandin 
dehydrogenase; / (from fop), prolyl 4-hydroxylase /J-subunit, cyto- 
keratin 20, cytokeratin 17, prohibition, and fructose 1,6-blphos- 
phatase; J annexin II; K, annexin IV; L (from top and left), 90-kDa heat 
shock protein, prolyl 4-hydroxytase 0-subunit, a-enolase, GRP 78, 
cyclophllin, and cofilin. 

gradient, and having a known chromosomal location, were 
selected for analysis In the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Fig. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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keratins encoded by genes on chromosome 17 (Fig, 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected in TCCs 733 and 335, and of these 19 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex- 
pressed in the non-invasive TCC 335 but lost in the invasive 
counterpart (TCC 733; see Fig. 5). The smaller number of 
proteins detected in both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

11 chromosomal regions where CGH showed aberrations 
that corresponded to the changes In transcript levels also 
showed corresponding changes in the protein level (Table II). 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexins II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area in invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene expression in two pairs of non-invasive 
and invasive TCCs using high throughput expression arrays 
and proteomics, in combination with CGH. in general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect, in most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how- 

Table II 

Proteins whose expression level correlates with both mRNA and gene dose changes 



Protein Chromosomal location Tumor TCC CGH alteration Transcript alteration* Protein alteration 



Annexin I) 


1q21 


733 


Gain 


Abs to Pres 8 


Increase 


Annexin IV 


2p13 


733 


Gain 


3.9-Fold up 


Increase 


Cytokeratin 17 


17q12-q21 


827 


Gain 


3.8-Fold up 


Increase 


Cytokeratin 20 


17q21.1 


827 


Gain 


5.6-Fold up 


Increase 


(PA-)FABP 


8q21.2 


827 


Loss 


10-Fotd down 


Decrease 


FBP1 


9q22 


827 


Gain 


2,3-Fold up 


Increase 


Plasma gelsolin 


9q31 


827 


Gain 


Abs to Pres 


Increase 


Heat shock protein 28 


15q12-q13 


827 


Loss 


2.5-Fold up 


Decrease 


Prohibitin 


17q21 


827/733 


Gain 


3.7-/2.5-Fold up 1 * 


Increase 


Prolyl-4-hydroxyl 


17q25 


827/733 


Gain 


5.7-/1 .6-Fold up 


Increase 


hnRNPBI 


7p15 


827 


Loss 


2.5-Fofd down 


Decrease 



* Abs, absent; Pres, present. 

b In cases where the corresponding alterations were found in both TCCs 827 and 733 these are shown as 827/733. 




Fig. 5. Comparison of protein and transcript levels In Invasive 
and non-invasive TCCs. The upper part of the figure shows a 2D gel 
{left) and the oligonucleotide array {right) of TCC 532. The red rectan- 
gles on the upper gel highlight the areas that are compared below. 
Identical areas of 2D gels of TCCs 532 and 827 are shown below. 
Clearly, cytokeratins 13 and 15 are strongly down-regulated in TCC 
827 {red annotation). The tile on the array containing probes for 
cytokeratin 15 is enlarged below the array (red arrow) from TCC 532 
and is compared with TCC 827. The upper row of squares in each tile 
corresponds to perfect match probes; the lower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
specific binding). Absence of signal is depicted as black, and the 
higher the signal the lighter the color. A high transcript level was 
detected in TCC 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cytokeratin 13, a high 
transcript level was also present in TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gets at 
the bottom of the figure {left) show levels of PA-FABP and adipocyte- 
FABP In TCCs 335 and 733 (invasive), respectively. Both proteins are 
down-regulated in the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript. A medium transcript level was de- 
tected in the case of TCC 335 (1277 units) whereas very low levels 
were detected in TCC 733 (166 units). tEF, isoelectric focusing. 
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ever, an increase or decrease in DNA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respectively. Some of these transcripts could, not be 
detected in the non-invasive tumor but were present at rela- 
tively high levels in areas with DNA amplifications in the inva- 
sive tumors (e.g. in TCC 733 transcript from cellular ligand of 
annexin II gene (chromosome 1q21) from absent to 2670 
arbitrary units; in TCC 827 transcript from small proline-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material in this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie In the loss of controlled 
methylation in tumor cells (17-19). Thus, it may be possible 
that in chromosomes with increased DNA copy numbers two 
or more alleles could be demethylated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has documented a 
ploidy regulation of gene expression in yeast, but in this case all 
the genes were present in the same ratio (22), a situation that is 
not analogous to that of cancer cells, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these include 9p-, 9q- ( 1q+, Y- 
(2, 6), and in pT1 tumors, 2q-,1 1p-, 1 1q~, 1q+. 5p+, 8q+, 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q- and Y~, respectively. Likewise, the two minimal invasive 
pT1 tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1 q22-24 
amplification (seen in both tumors), 11q14-q22 loss, the latter 
often linked to 17 q+ (both tumors), and 1q+ and 9p-, often 
linked to 20q+ and 11 q1 3+ (both tumors) (7-9). These ob- 
servations indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it is only possible to get a crude picture of 
chromosomal instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with increased copy numbers. Analysis of these regions 
by positioning heterozygous microsatellites as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity In several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA microarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploldy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic Imprinting has 
an impact on the expression level in normal cells and is often 
reduced in tumors. However, the relation between imprinting 
and gain of chromosomal material is not known. 

We regard it as a strength of this investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, mRNA and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to translational 
regulation, post-translational processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated mRNA pools, which are associated with few 
translationally inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very important in the case of polypeptides with a short 
half-life (e.g. signaling proteins). A poor correlation between 
mRNA and protein levels was found in liver ceils as deter- 
mined by arrays and 2D-PAGE (25), and a moderate correla- 
tion was recently reported by Ideker ef al. (26) in yeast, 
(interestingly, our study revealed a much better correlation 
between gained chromosomal areas and Increased mRNA 
levels than between loss of chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript?) One possible 
explanation could be that by losing one allele the change in 
mRNA level is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severalfoid increase in gene copy number resulting in a much 
higher impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may in the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression, and protein level. Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area in chromosome 17q. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated is the large 
extent of protein modification that occurs after translation, 
requiring immunoidentification and/or mass spectrometry to 
correctly identify the proteins in the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on microarrays with many thousand radiation 
hybrid-mapped genes will increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 2D gel analysis to obtain information at 
the post-translational level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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ABSTRACT 

Genetic changes underlie tumor progression and may lead to cancer- 
specific expression of critical genes. Over 1100 publications have de- 
scribed the use of comparative genomic hybridization (CGH) to analyze 
the pattern of copy number alterations in cancer, but very few of the genes 
affected are known. Here, wc performed high-resolution CGH analysis on 
cDNA microarrays in breast cancer and directly compared copy number 
and mRNA expression levels of 13,824 genes to quantitate the impact of 
genomic changes on gene expression. We identified and mapped the 
boundaries of 24 independent amplicons, ranging in size from 0.2 to 12 
Mb. Throughout the genome, both high- and low-level copy number 
changes had a substantial impact on gene expression, with 44% of the 
highly amplified genes showing overexpression and 10.5% of the highly 
overcxpressed genes being amplified. Statistical analysis with random 
permutation tests Identified 270 genes whose expression levels across 14 
samples were systematically attributable to gene amplification. These 
included most previously described amplified genes in breast cancer and 
many novel targets for genomic alterations, including the HOXB7 gene, 
the presence of which in a novel amplicon at 17q213 was validated in 
10.2% of primary breast cancers and associated with poor patient prog- 
nosis. In conclusion, CGH on cDNA microarrays revealed hundreds of 
novel genes whose overexpression is attributable to gene amplification. 
These genes may provide Insights to the clonal evolution and progression 
of breast cancer and highlight promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
(1-6). Despite this progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mained elusive, and the utility of gene expression profiling in the 
identification of specific therapeutic targets remains limited. 

Accumulation of genetic defects is thought to underlie the clonal 
evolution of cancer. Identification of the genes that mediate the effects 
of genetic changes may be important by highlighting transcripts that 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets for anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 and EGFR (7, 8), in breast cancer and 
other solid tumors. Besides amplifications of known oncogenes, over 
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Copy number ratio 




Expression ratio 

Fig. 1 . Impact of gene copy number on global gene expression levels. A. percentage of 
over- and undcrexprcsscd genes (Y uxix) according to copy number ratios (X axis). 
Threshold values used for over- and undercxpression were >2.184 (global upper 7% of 
the cDNA ratios) and <0.4826 (global lower 7% of the expression ratios). B. percentage 
of amplified and deleted genes according to expression ratios. Threshold values for 
amplification and deletion were >1.5 and <0.7. 



20 recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 5 (9, 10). However, these amplicons are often 
large and poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes that are attributable to underlying gene copy 
number alterations would highlight transcripts that are actively in- 
volved in the causation or maintenance of the malignant phenotype. 
To identify such transcripts, we applied a combination of cDNA and 
CGH microarrays to: (a) determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and (b) identify and characterize those genes whose mRNA expres- 



s The abbreviations used are: CGH, comparative genomic hybridization; FISH, fluo- 
rescence in situ hybridization; RT-PCR, reverse transcription-PCR. 

6240 




GENE EXPRESSION PATTERNS IN BREAST CANCER 




Fig. 2. Genome-wide copy number and expression analysis in the MCF-7 breast cancer cell line. A. chromosomal CGH analysis of MCF-7. The copy number ratio profile (blue 
line) across the entire genome from lp telomere to Xq telomere is shown along with ±1 SD {orange lines). The black horizontal line indicates a ratio of 1.0; red line, a ratio of 0.8; 
and green line, a ratio of 1.2. B-C, genome-wide copy number analysis in MCF-7 by CGH on cDNA microarray. The copy number ratios were plotted as a function of the position 
of the cDNA clones along the human genome. In B> individual data points arc connected with a line, and a moving median of 10 adjacent clones is shown. Red horizontal line, the 
copy number ratio of 1 .0. In C individual data points are labeled by color coding according to cDN A expression ratios. The bright red dots indicate the upper 2%, and dark red dots, 
the next 5% of the expression ratios in MCF-7 cells (overexpressed genes); bright green dots indicate the lowest 2%, and dark green dots, the next 5% of the expression ratios 
(undercxpressed genes); the rest of the observations arc shown with black crosses. The chromosome numbers arc shown at the bottom of the figure, and chromosome boundaries arc 
indicated with a dashed line. 



sion is most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AND METHODS 

Breast Cancer Cell Lines. Fourteen breast cancer cell lines (BT-20, BT- 
474, HCC1428, Hs578t, MCF7, MDA-361, MDA-436, MDA-453, MDA-468, 
SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
using standard protocols. 

Copy Number and Expression Analyses by cDNA Microarrays. The 
preparation and printing of the 13,824 cDNA clones on glass slides were 
performed as described (11-13). Of these clones, 244 represented uncharac- 
terized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14, 
15). Briefly, 20 of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-18 h with Alu\ and Rsa\ (Life Technol- 
ogies, Inc., Rockville, MD) and purified by phenol/chloroform extraction. Six 
of digested cell line DNAs were labeled with Cy3-dUTP (Amersham 
Pharmacia) and normal DNA with Cy5-dUTP (Amersham Pharmacia) using 
the Bioprime Labeling kit (Life Technologies, Inc.). Hybridization (14, 15) and 
posthybridization washes (13) were done as described. For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagene, 
La Jolla, CA) was used in all experiments. Forty tig of reference RNA were 
labeled with Cy3-dUTP and 3.5 ^g.of test mRNA with Cy5-dUTP, and the 
labeled cDNAs were hybridized on microarrays as described (13, 15). For both 
microarray analyses, a laser confocal scanner (Agilent Technologies, Palo 
Alto, CA) was used to measure the fluorescence intensities at the target 
locations using the DE ARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by the 
average intensity of the corresponding clone in the control hybridization. For 
the copy number analysis, the ratios were normalized on the basis of the 
distribution of ratios of all targets on the array and for the expression analysis 
on the basis of 88 housekeeping genes, which were spotted four times onto the 
array. Low quality measurements (i.e., copy number data with mean reference 
intensity <100 fluorescent units, and expression data with both test and 
reference intensity <I00 fluorescent units and/or with spot size <50 units) 
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were excluded from the analysts and were treated as missing values. The 
distributions of fluorescence ratios were used to define cutpoints for increased/ 
decreased copy number. Genes with CGH ratio >1.43 (representing the upper 
5% of the CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <0,73 (representing the lower 5%) were considered to be 
deleted. 

Statistical Analysis of CGH and cDNA Microarray Data. To evaluate 
the influence of copy number alterations on gene expression, wc applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled 1 for amplification (ratio, > 1 .43) and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-to-noise 
statistics (1). Wc calculated a weight, w K , for each gene as follows: 

m 8 i ™ m go 

where m si , <t hX and m^, denote the means and SDs for the expression 
levels for amplified and nonamplifted cell lines, respectively. To assess the 
statistical significance of each weight, we performed 10,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation than the original weight was denoted by a. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and Amplkon Mapping. Each 
cDNA clone on the microarray was assigned to a Unigene cluster using the 
Unigene Build 141. 6 A database of genomic sequence alignment information 
for mRNA sequences was created from the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoldenPath database. 7 The chromosome and 
bp positions for each cDNA clone were then retrieved by relating these data 
sets. Amplicons were defined as a CGH copy number ratio >2.0 in at least two 
adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 
adjacent clones in a single cell line. The amp] icon start and end positions were 



6 Internet address: http^/research. rihgri.nih.gov/microarray/downloadabte_cdna.htral. 

7 Internet address: www.gcnomc.ucsc.edu. 
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Table l Summary of independent amplicons in 14 breast cancer cell lines by 
CGH microarray 



Location 


Start (Mb) 


End (Mb) 


Size (Mb) 


lp!3 


132.79 


132.94 


0.2 


lq21 


173.92 


177.25 


3.3 


lq22 


" 179.28 


179.57 


0.3 


3pl4 


71.94 


74.66 


2.7 


7pl2.l-7pll.2 


55.62 


60.95 


5.3 


7q31 


125.73 


1 30.96 


5.2 


7q32 


140.01 


140.68 


0.7 


8q21.ll-8g.2U 3 


86.45 


92.46 


6.0 


8q21.3 


98.45 


103.05 


4.6 


8q23.3-Sq24.J4 


129.88 


142.15 


12.3 


8q24.22 


151,21 


152.16 


1.0 


9pl3 


38.65 


39.25 


0.6 


!3q22-q3l 


77.15 


81.38 


4,2 




86.70 


87.62 


0.9 


17ql 1 


29.30 


30.85 


1.6 


I7ql2-q21.2 


39.79 


42.80 


3.0 


17q21.32-q2U3 


52.47 


55.80 


3.3 


17q22-q23.3 


63.81 


69.70 


5.9 


17q23.3-q24.3 


69.93 


74.99 


5.1 


I9ql3 


40.63 


41.40 


0.8 


20q 11.22 


34.59 


35.85 


1,3 


20ql3.12 


44.00 


45.62 


1.6 


20qJ3.12-ql3.13 


46.45 


49.43 


3.0 


20ql3^-qI3.32 


51.32 


59.12 


7.8 



NS IN BREAST CANCER 

CGH were validated, with lq21, 17ql2-q21.2, 17q22-q23, 20ql3,l, 
and 20ql3.2 regions being most commonly amplified. Furthermore, 
the boundaries of these amplicons were precisely delineated. In ad- 
dition, novel amplicons were identified at 9pl3 (38.65-39.25 Mb), 
and 17q21 J (52.47-55.80 Mb). 

Direct Identification of Putative Amplification Target Genes. 
The cDNA/CGH microarray technique enables the direct correla- 
tion of copy number and expression data on a gene-by-gene basis 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at lpl3, 17q22-q23, and 20ql3 were highly overex- 
pressed. A view of chromosome 7 in the MDA-468 cell line 
implicates EGFR as the most highly overexpressed and amplified 
gene at 7pll-pl2 (Fig. ZA). In BT-474, the two known amplicons 
at 17ql2 and 17q22-q23 contained numerous highly overex- 
pressed genes (Fig. 35). In addition, several genes, including the 
homeobox genes HOXB2 and HOXB7, were highly amplified in a 
previously undescribed independent amplicon at 17q21.3. HOXB7 
was systematically amplified (as validated by FISH, Fig. IB, inset) 
as well as overexpressed (as verified by RT-PCR, data not shown) 
in BT-474, UACC812, and ZR-75-30 cells. Furthermore, this novel 



extended to include neighboring nonamplified clones (ratio, <1.5). The am- 
plicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RP11-361K8 was la- 
beled with SpectrumOrange (Vysis, Downers Grove, IL), and Spectrum- 
Orangc-labelcd probe for EGFR was obtained from Vysis. SpectrumGreen- 
labeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference. A tissue microarray containing 612 formalin-fixed, paraffin-embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
(18). The use of these specimens was approved by the Ethics Committee of the 
University of Basel and by the NIH. Specimens containing a 2-fold or higher 
increase in the number of test probe signals, as compared with corresponding 
centromere signals, in at least 10% of the tumor cells were considered to be 
amplified. Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test. 

RT-PCR. The HOXB7 expression level was determined relative to 
GAPDH. Reverse transcription and PCR amplification were performed using 
Access RT-PCR System (Promega Corp., Madison, WI) with 10 ng of mRNA 
as a template. HOXB7 primers were 5 '-GAGCAGAGGGACTCGG ACTT-3' 
and 5'-GCGTCAGGTAGCGATTGTAG-3'. 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH microarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5) were overexpressed (i.e., belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. 1A). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio, >10) 
showed increased copy number (Fig. Iff). Low-level copy number 
increases and decreases were also associated with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for 1 1 7 994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kb. This high-resolution mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to 12 Mb of DNA (Table 
1). Several amplification sites detected previously by chromosomal 
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Fig. 3. Annotation of gene expression data on CGH microarray profiles. A, genes in the 
7pU-pl2 amplicon in the MDA-468 cell line arc highly expressed (red dots) and include 
the EGFR oncogene. B, several genes in the I7ql2, I7q2l,3, and I7q23 amplicons in the 
BT-474 breast cancer cell line are highly overexpressed (red) and include the HOXB7 
genc. The data labels and color coding arc as indicated for Fig. 2C Insets show 
chromosomal CGH profiles for the corresponding chromosomes and validation of the 
increased copy number by interphase FISH using EGFR (red) and chromosome 7 
centromere probe (green) to MDA-468 (A) and //CtEB7-specific probe {red) and chro- 
mosome 17 ccntTOmero (green) to BT-474 cells (B). 
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Fig. A. List of 50 genes with a statistically 
significant correlation (a value <0.05) between 
gene copy number and gene expression. Name, 
chromosomal location, and the a value for each 
gene are indicated. The genes have been ordered 
according to their position in the genome. The color 
maps on the right illustrate the copy number and 
expression ratio patterns in the 14 cell lines. The 
key to the color code is shown at the bottom of the 
graph. Gray squares, missing values, The complete 
list of 270 genes is shown in supplemental Fig. B. 
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amplification was validated to be present in 10.2% of 363 primary 
breast cancers by FISH to a tissue microarray and was associated 
with poor prognosis of the patients (P « 0-001). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Amplicons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
270 genes whose expression was significantly influenced by copy 
number across all 14 cell lines (Fig. 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data, 8 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these; 151 
(84%) are implicated in apoptosis, cell proliferation, signal transduc- 
tion, and transcription, whereas 28 (16%) had functional annotations 
that could not be directly linked with cancer. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in >1000 publications applying CGH 9 (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somatic genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific amplicons (15, 19-21). Here, we applied genome- 
wide cDNA microarrays to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantial with the most dramatic effects seen in the case of high- 



* Internet address: http://www.gcneontology.org/. 



9 Internet address: http://www.ncbi.nlm. nih.gov/cntrez. 
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level copy number increase. Low-level copy number gains and losses 
also had a significant influence on expression levels of genes in the 
regions affected, but these effects were more subtle on a gene-by-gene 
basis than those of high-level amplifications. However, the impact of 
low-level gains on the dysregulation of gene expression patterns in 
cancer may be equally important if not more important than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore , have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24), 

The CGH microarray analysis identified 24 independent breast 
cancer amplicons. We defined the precise boundaries for many am- 
plicons detected previously by chromosomal CGH (9, 10, 25, 26) and 
also discovered novel amplicons that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger amplicons. One of these novel amplicons 
involved the homeobox gene region at 1 7q2 1 .3 and led to the over- 
expression of the HOXB7 and HOXB2 genes. The homeodomain 
transcription factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogenesis in breast cancer (29-32). The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7 contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing —2% of all genes on the array), including not only 
previously described amplified genes, such as HER-2, MYC> 
EGFR, ribosomal protein s6 kinase, and AIB3 t but also numerous 
novel genes such as NRAS-r elated gene (lp!3), syndecan-2 (8q22), 
and bone morphogenic protein (20ql3,l), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of the 270 genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we. demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (b) a high-resolution map of 24 independent 
amplicons in breast cancer; and (c) identification of a set of 270 
genes, the overexpression of which was statistically attributable to 
gene amplification. Characterization of a novel amplicon at 
17q21.3 implicated amplification and overexpression of the 
HOXB7 gene in breast cancer, including a clinical association 



between HOXB7 amplification and poor patient prognosis. Overall, 
our results illustrate how the identification of genes activated by 
gene amplification provides a powerful approach to highlight 
genes with an important role in cancer as well as to prioritize and 
validate putative targets for therapy development. 

REFERENCES 

1. Golub, T. R., Slonim, D. K., Tamayo, P., Huard, C-, Gaascnbcck, M., Mcsirov, J, P., 
Coller. H„ Loh, M, L., Downing. J. R., Caligiuri, M. A., Bloomfield, C. D., and 
Lander, E. S. Molecular classification of cancer: class discovery and class prediction 
by gene expression monitoring. Science (Wash, DC), 286: 531-537, 1999. 

2. Alizadeh, A. A., Eisen, M. B. ( Davis, R. E., Ma, C, Lossos, I. S., Rosenwald, A. t 
BoldricL, J. C, Sabet, H., Tran, T., Yu, X., el al Distinct types of diffuse large B-cell 
lymphoma identified by gene expression pro film g. Nature (Lond), 40 3: 503-51 1, 
2000. 

3. Bittner, M., Mcltzcr, P., Chen, Y„ Jiang, Y., Scftor, E.. Hendrix, M., Radmachcr, M., 
Simon, R., Yakhmi, Z., Ben-Dor, A., et al. Molecular classification of cutaneous 
malignant melanoma by gene expression profiling. Nature (Lond.), 406: 536-540, 
2000. 

4. Pcrou, C. M., Sorlic t T., Eisen, M. B., van de Rijn, M„ Jeffrey, S. S., Rees, C. A., 
Pollack, J, R., Ross, D. T (> Johnscn, H., Akslen, L. A., et ai Molecular portraits of 
human breast tumours. Nature (Lond.), 406: 747-752, 2000. 

5. Dhanasekaran, S. M., Barrette, T, R., Ghosh, D., Shah, R., Varambally, S., Kurachi, 
K., Pienta, K_ J., Rubin, M. A., and Chinnaiyan, A, M. Delineation of prognostic 
biomarkers in prostate cancer. Nature (Lond.), 412: 822-826, 2001. 

6. Sorlie, T., Pcrou, C. M., Tibshirani, R., Aas, T., Gcislcr, S. ( Johnscn, H., Hastie, T., 
Eisen, M, B„ van de Rijn, M., Jeffrey, S. S„ etai Gene expression patterns of breast 
carcinomas distinguish tumor subclasses with clinical implications. Proc. Natl. Acad. 
Sci. USA, 98: IQ869-10874, 2001. 

7. Ross, J. S., and Fletcher, J. A. The HER-2/neu oncogene: prognostic factor, predictive 
factor and target for therapy. Semin. Cancer Biol, 9: 125-138, 1999. 

8. Arteaga, C, L. The epidermal growth factor receptor: from mutant oncogene in 
nonhuman cancers to therapeutic target in human neoplasia. J. Clin. Oncol., 19: 
32-40, 2001. 

9. Knuutila, S., Bjorkqvist, A. M., Autio, K-, Tarkkancn, M., Wolf, M., Monni, O., 
Szymanska, J., Larramendy, M. L., Tapper, J., Pcrc, H„ El-Rifai, W,, et al. DNA copy 
number amplifications in human neoplasms: review of comparative genomic hybrid- 
ization studies. Am. J. Pathol, 152: 1 107-1 123, 1998. 

10. Knuutila S., Autio K., and Aalto Y. Online access to CGH data of DNA sequence 
copy number changes. Am. J. Pathol, 157: 689, 2000. 

1 1 . DeRisi, J., Penland, L., Brown. P. O., Bittner, M. L. t Meltzer, P. S., Ray, M., Chen, 
Y., Su, Y. A., and Trent, J. M. Use of a cDNA microarray to analyse gene expression 
patterns in human cancer. Nat Genet., 14: 457-460, 1996. 

12. Shalon, D., Smith, S. J., and Brown, P. O. A DNA microarray system for analyzing 
complex DNA samples using two-color fluorescent probe hybridization. Genome 
Res., 6*: 639-645, 1996. 

13. Mousses, S„ Bittner, M. L., Chen, Y., Dougherty, E R., Baxcvanis, A., Meltzer, P. S.. 
and Trent, J, M. Gene expression analysis by cDNA microarrays. In: F. J. Livesey and 
S.P. Hunt (eds.), Functional Genomics, pp. 113-137. Oxford: Oxford University 
Press, 2000. 

14. Pollack, J. R., Perou, C. M., Atizadeh, A. A., Eisen, M. B., Pergamcnschikov, A„ 
Williams, C. F., Jeffrey. S. S„ Botstein, D., and Brown, P. O. Genome-wide analysis 
of DNA copy-number changes using cDNA microarrays. Nat Genet, 23: 41-46, 
1999. 

15. Monni, O., Barlund, M., Mousses, S., Kononcn, J., Sautcr, G„ Hciskancn, M., 
Poavola, P„ A vela, K., Chen, Y., Bittner, M. L., and kallionicmi, A. Comprehensive 
copy number and gene expression profiling of the I7q23 amplicon in human breast 
cancer. Proc. Nail Acad. Sci. USA, 98: 5711-5716, 2001. 

16. Chen, Y., Dougherty, E. R , and Bittner, M. L. Ratio-based decisions and the 
quantitative analysis of cDNA microarray images. J. Biomcd Optics, 2: 364-374, 
1997. 

17. Barlund, M., Forozan, F., Kononcn, J., Bubcndorf, L., Chen, Y., Bittner, M. L„ 
Torhorst, J„ Haas, P., Bucher, C, Sautcr, G„ er al Detecting activation of ribosomal 
protein S6 kinase by complementary DNA and tissue microarray analysis. J. Natl 
Cancer Inst, 92: 1252-1259, 2000. 

18. Andersen, C. L„ Hostettcr, G., Grigoryan, A„ Sautcr, G., and Kallionicmi, A. 
Improved procedure for fluorescence in situ hybridization on tissue microarrays. 
Cytometry, 45: 83-86, 2001. 

19. Kauraniemi, P., Barlund. M., Monni, O., and Kallionicmi, A. New amplified and 
highly expressed genes discovered in the ERBB2 amplicon in breast cancer by cDNA 
microarrays. Cancer Res., 61: 8235-8240, 2001. 

20. Clark, J., Edwards. S., John, M., Flohr, P., Gordon, T., Maillard, K., Giddings, L, 
Brown, C. Bagherzadeh, A., Campbell, C„ Shipley, J., Wooster, R„ and Cooper, 
C. S. Identification of amplified and expressed genes in breast cancer by comparative 
hybridization onto microarrays of randomly selected cDNA clones. Genes Chromo- 
somes Cancer, 34: 104-1 14, 2002. 

21. Varis, A., Wolf, M., Monni, O., Vakkari, M. L., Kokkola, A., Moskaluk, C, Frierson, 
H„ Powell, S. M., Knuutila. S., Kallionicmi, A., and El-Rifai, W. Targets of gene 
amplification and overexpression at 1 7q in gastric cancer. Cancer Res., 62: 2625- 
2629, 2002. 

22. Hughes, T. R., Roberts, C J., Dai, H„ Jones, A. R., Meyer, M. R., Slade, D., 
Burchard, J., Dow, S., Ward, T. R., Kidd, M. J.. Friend, S. R, and Marton M. J. 



GENE EXPRESSION PATTERNS IN BREAST CANCER 



Widespread ancuploidy revealed by DNA microarray expression profiling. Nat. 
Genet., 25: 333-337, 2000. 

23. Virtancva, K., Wright, F. A., Tanner, S. Yuan, B., Lemon, W. J., Caligiuri, M, A., 
Bloomfield, C. D., de La Chapclle, A., and Krahe, R. Expression profiling reveals 
fundamental biological differences in acute myeloid leukemia with isolated trisomy 8 
and normal cytogenetics. Proc, Natl, Acad. Sri. XJSA, 98: 1 124-1 129, 2001. 

24. Phillips, J. L., Hayward, S. W., Wang* Y., Vassclli, J., Pavlovich, C, Padilla-Nash, 
H., Pczullo, J. R., Ghadimi, B. M., Grossfcld, G. D., Rivera, A., Linchan, W. M„ 
Cunha, G. R. ( and Ricd. T. The consequences of chromosomal ancuploidy on gene 
expression profiles in a cell line model for prostate carcinogenesis. Cancer Res., 67: 

. 8143-8149,2001. 

25. BSrlund, M., Tirkkoncn, M., Forozan, F., Tanner, M. M., Kallioniemi, 0- P., and 
Kallioniemi, A. Increased copy number 3t I7q22-q24 by CGH in breast cancer is due 
to high-level ampli ft cation of two separate regions. Genes Chromosomes Cancer, 20: 
372-376, 1997. 

26. Tanner, M. M., Tirkkonen, M. r Kallioniemi, A., Isola, J., Kuukasjarvi, T., Collins, C, 
Kowbel, D.. Guan, X. Y., Trent, J., Gray, J. W. T Meltzer, P., and Kallioniemi O. P. 
Independent amplification and frequent co-amplification of three nonsyntcnic regions 



on the long arm of chromosome 20 in human breast cancer. Cancer Res., 56: 
3441-3445. 1996. 

27. Cillo, C, Faiella, A., Cantile, M., and Boncinclli, E. Homeobox genes and cancer. 
Exp. Cell Res., 248: 1-9. 1999. 

28. Cillo, C, Cantile, M., Faiclla, A., and Boncinelli, E. Homeobox genes in normal and 
malignant cells. J. Cell. Physiol., J88: 161-169, 2001. 

29. Care. A., Silvani, A., Meccia, E. t Mauia, G„ Stoppacctaro, A., Parmiani, G„ Peschlc, 
C. and Colombo, M. P. HOXB7 constitutively activates basic fibroblast growth 
factor in melanomas. Mol. Cell, Biol., 16: 4842-4851, 1996. 

30. Care, A., Silvani, A., Meccia, E., Mattia, G., Peschlc, C, and Colombo, M. P. 
Transduction of the SkBr3 breast carcinoma cell line with the HOXB7 gene induces 
bFGF expression, increases cell proliferation and reduces growth factor dependence. 
Oncogene, 16: 3285-3289, 1998. 

31. Care, A., Feliccttt, F„ Meccia, E., Bottero, L., Parenza, M., Stoppacciaro, A., Peschle, 
C, and Colombo, M. P. HOXB7: a key factor for tumor-aasocialcd angiogenic switch. 
Cancer Res., 61: 6532-6539, 2001. 

32. Naora, H., Yang, Y. Q., Montz, F. J., Seidman, J. D., Kurman, R. J., and Roden, R. B. 
A serologically identified tumor antigen encoded by a homeobox gene promotes 
growth of ovarian epithelial cells. Proc. NatL Acad. Sci. USA, 98: 4060-4065, 2001. 



6245 




f Protocol 12 

RNA Common Reference Sets 

Jonathan R. Pollack 

Department of Pathology, Stanford University School of Medicine, Stanford, California 94305 

To measure gene expression using microarrays, two different mRNA preparations are labeled, 
using different fluors, and cohybridized to a cDNA microarray. For each gene-specific ele- 
ment in the array, the ratio of intensity of the two fluors reflects the relative abundance of that 
particular mRNA in the two preparations. 

The use of two-color labeling has the important property of robustly eliminating effects 
due to variation in the amount of DNA spotted on the arrays, variation in the stringency of 
the hybridization (salt concentration, temperature, and duration of hybridization), and vari- 
ation in the local concentration of labeled sample during hybridization. Any changes in these 
parameters should affect the numerator and denominator of fluorescence ratios equally and 
thus would not influence measured fluorescence ratios. 

In studies that include many different RNA preparations, it becomes impractical to 
hybridize all pair-wise combinations of samples. The solution is to hybridize each of the 
experimental samples against the same common reference RNA preparation, and to calculate 
the level of each mRNA relative to the reference. The relative abundances of mRNAs among 
the different experimental samples are then calculated indirectly, from the ratio of ratios. 

For many experiments, a "natural" RNA reference sample is available, for example, when 
quantifying changes in mRNA abundance over time (reference = time zero sample) or after 
treatment with a drug (reference = control unexposed to drug). However, in certain 
instances, such as in the measurement of mRNA abundances in different tumor samples, no 
natural reference exists. In such cases, an mRNA "common reference set" can be used (Eisen 
and Brown 1999). If each test (i.e., tumor) sample is compared to the same common refer- 
ence set, then the relative intensity of gene expression among the samples can be calculated. 
In this type of experiment, the common reference set becomes the denominator for all meas- 
ured fluorescence ratios. 

An ideal common reference set should hybridize with sufficient signal intensity to all or 
nearly all of the DNA elements in the array, should be easy and cost-effective to prepare, and 
should be easily reproduced to allow comparison of samples over time and between locations. 
Note that the common reference set need not have any particular biological purpose or prop- 
erty and need not include RNA from every cell type under investigation. However, a good ref- 
erence set should generate a readable signal from most elements on the array. In addition, for 
the reference set to be useful to a broad group of researchers, it should be constructed from val- 
idated sources that-are readily available. In practical terms, a common reference comprising 
pooled RNA from stable lines of cultured cells meets many of these criteria. 

This protocol outlines the preparation of an RNA common reference set used extensively 
by the cDNA microarray community at Stanford University (Perou et al. 1999, 2000; Ross ct 
al. 2000). This reference set is produced from 1 1 different established cell lines (see Table 3- 
6), many of which are available from the American Type Culture Collection. These cell lines 
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are easy to grow and, in combination, hybridize to a large majority of arrayed cDNA ele- 
ments. Although the optimal number of cell lines used in a reference set has not been estab- 
lished experimentally, it is perhaps important not to use too many different cell types. The 
pooled RNA may dilute to the point of invisibility low-abundance transcripts that are 
expressed in only one or a few of the cell lines in the set. 

Most laboratories with the infrastructure required for DNA microarrays should be able to 
reproduce the Stanford common reference set. In general, the work load involved in produc- 
ing a batch of the common reference set typically could require a period of 2-3 months of 
full-time work to grow up the 1 1 cell lines and purify sufficient RNA for several thousand 
array hybridizations. However, for some investigators, a commercial reference set may be the 
best option. Stratagene Inc. sells a common reference set of RNA extracted from ten human 
cell lines (Novoradovskaya et aL 2000). However, unlike the Stanford set, few details are avail- 
able about the cell lines and their provenance. 

Both the Stanford and the Stratagene common reference sets allow rational comparison 
of microarray results within and between laboratories. Other possible reference sets, less 
commonly used, that may be able to fulfill the same purpose include labeled genomic DNA 
or RNA generated by transcription of a pool of cDNAs in vitro. 

WHY USE A COMMON REFERENCE SET? 

Consider an experiment whose goal is to identify genes that are expressed at different levels 
in tumors and normal cells. Two experimental designs are possible: 

• Tumor versus tumor versus normal 
Samples: T1,T2,N1,N2 
Options for comparison: 

Tl vs.Nl,T2 vs.N2,Tl vs. N2, T2 vs. Nl, Tl vs.T2.Nl vs. N2 
This matrix provides a comparison of all possible combinations and has the advan- 
tage that questions are addressed by direct comparison: How does one tumor com- 
pare with another? How does tumorous tissue compare with normal? How does 
normal compare with normal? However, this approach is only practical when the 
number of samples is very small, as the number of combinations increases expo- 
nentially with the number of samples. 

• Tumor versus reference, normal versus reference 
Samples: Tl, T2, Nl, N2, R 

Options for comparison: 

Tl vs. R, T2 vs. R, Nl vs. R, N2 vs. R 

Here, all samples are tested against a common reference RNA and comparison is 
made indirectly with respect to the reference. The number of combinations is man- 
ageable as they increase linearly with the number of samples. In addition, compar- 
isons can be made between experiments and investigators who use a common ref- 
erence. A potential disadvantage is that inferences are made indirectly between 
samples, via the reference, potentially reducing the resolving power of the experi- 
ment. The ideal common reference therefore should be widely accessible, available 
in unlimited amounts, and should provide a signal over a wide range of genes. 
These. issues are considered in more detail in Section 7. 
Figure 3-12 illustrates the differences in these two experimental designs. 
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FIGURE 3-12. Differences in performing all pairwise combinations vs. using a common reference. 



MATERIALS 

CAUTION: Please see Appendix 3 for appropriate handling of materials marked with <!>. 

Buffers and Solutions 

Please see Appendix 2 for components of stock solutions, buffers, and reagents. 
Dilute stock solutions to the appropriate concentrations. 

ATRA (all-fra/15 retinoic acid; R 2625, Sigma) 

Prepare a 10 mM stock in DMSO. 
Cell culture media 

RPM1 1640 (G1BCO or equivalent) 

10% fetal bovine serum (Hyclone or equivalent) 

penicillin and streptomycin (lx concentration, GIBCO) 
Phosphate-buffered saline (PBS), lacking Ca 2+ and Mg 2+ 
lx Trypsin-EDTA 

Gels 

Agarose gel (1%) containing 2.2 M formaldehyde <!> 
For details, please see Sambrook and Russell (2001). 

Special Equipment 

FastTrack 2.0 mRNA isolation kit (Invitrogen) 

Humidified incubator with an atmosphere of 5% C0 2 

Laminar flow hood for passaging and harvesting of cells 

Roller bottles and appropriate incubator (alternative to tissue culture dishes) 

Tissue culture dishes (15 cm) 

UV spectrophotometer 
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TABLE 3-6. Cell Lines Used in Stanford "Common RNA Reference" 







Growth 


ATCC catalog No. 


Name 


Description 


properties 


or. Reference 


MCF7 


breast adenocarcinoma-derived cell line 


adherent 


A , T*/"" , /" , TJTD "> "\ 

ATCC H I d-22 


Hs578T 


breast adenocarcinoma-derived cell line (stromal-like) 


adherent 


ATCC HTB-126 


NTERA2 


teratoma-derived cell line 


adherent 


ATCC CRL-1973 


Cok>205 


colon tumor-derived cell line 


mixed 


ATCC CCL-222 


OVCAR-3 


ovarian tumor-derived cell line 


adherent 


ATCC HTB-161 


UACC-62 


melanoma-derived cell line 


adherent 


Stinson et al. (1992) 


MOLT-4 


T cell leukemia-derived cell line 


suspension 


ATCC CRL-1582 


RPMI 8226 


multiple myeloma-derived cell line 


suspension 


ATCC CCL-155 


NB4+ATRA 


acute promyelocytic leukemia-derived cell line 


suspension 


Lanotte et al. (1991) 


SW872 


liposarcoma-derived cell line 


adherent 


ATCC HTB-92 


HepG2 


liver tumor-derived cell line 


adherent 


ATCC HB-8065 



Cells and Tissues 

Cell lines 

Please see Table 3-6 for a listing and description of cell lines used in the Stanford common ref- 
erence set of RNAs. Of the 1 1 cell lines, 9 are currently available from the American Type Culture 
Collection (www.atcc.org). However, because NB4 and UACC-62 are not at present distributed 
by the ATCC, it is impossible to replicate the Stanford Common Reference Set from publicly 
available sources. We therefore recommend that HL-60 cells (ATCC CCL-240) be substituted for 
NB4 cells and that the SK-MEL-2 melanoma cell line (ATCC HTB-68) be substituted for UACC- 
62 cells. Because the aim should be to preserve the "commonality" of the reference set as far as 
possible, other substitutions by individual laboratories are discouraged. 

METHOD 
Growing Cell Lines 

The 1 1 cell lines listed in Table 3-6 are easy to maintain and grow in either monolayer or sus- 
pension culture. This protocol is designed for cells growing in culture dishes. If suspension 
cultures are grown in flasks or roller bottles, adjust the protocol accordingly. 

1 . Maintain the 11 cell lines in RPMI 1640 supplemented with 10% fetal calf serum in 15- 
cm tissue culture dishes at 37°C in an atmosphere of 5% C0 2 in air. 

Cells should be passaged when approaching confluence at -1:5 dilution using standard 
trypsinization procedures for adherent lines. 

Record passage numbers of cell lines in a log book or database. Store master cultures of early 
passages in liquid nitrogen in small aliquots in medium containing 10% DMSO. 
Discard cell lines after ten passages in cell culture and replace them from frozen master cultures. 
Wherever possible, use the same batch of fetal calf serum to grow the 1 1 cell lines. Serum 
is probably a significant source of batch-to-batch variation in reference RNAs. Record the 
batch numbers of serum and medium used for each passage of each cell line. 

2. Expand the cell numbers so that 10-20 15-cm tissue culture dishes of each cell line can 
be grown. 

3. Change the media 48 hours before harvesting the cells for isolation of mRNA. 

Media changes produce alterations in the pattern and intensity of gene expression. To min- 
imize variation between preparations of common reference RNAs, establish standardized 
protocols for growing and refeeding cells and follow them faithfully. 
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4. Add all-trans retinoic acid (ATRA) to a final concentration of 10 \M to the culture medi- 
um of NB4 cells (or substituted HL-60 cells) -48 hours before harvesting the cells. 

NB4 and HL-60 cells are derived from cases of human promyelocyte leukemia. Addition 
of ATRA induces, over the course of 48 hours, the expression of genes specific to granulo- 
cytic differentiation with concomitant maturation of the cells. 

5. Harvest the cells for mRNA isolation at -80% confluence. 

Patterns of gene expression change as cell cultures approach confluence. To minimize vari- 
ation among preparations of common reference RNAs, standardize protocols for growing 
and harvesting cells and follow them faithfully. 

Preparation of mRNA 

6. Prepare mRNA from the cell lines using, for example, the FastTrack 2.0 (Invitrogen) 
mRNA isolation kit. 

a. Remove the media from the culture by aspiration. 

b. Lyse the cells on the surface of the dishes using the lysis buffer supplied by the manu- 
facturer. 

c. Isolate the mRNA by batch chromatography on oligo(dT)-cellulose. 

A number of methods can be used to prepare either total RNA or mRNA from cell lines. 
The Stanford group routinely uses the FastTrack 2.0 (Invitrogen) mRNA isolation kit. 
The FastTrack kit yields mRNA of high quality for labeling and hybridization to DNA 
microarrays, although less expensive alternatives are likely to be equally effective. 
An outline of the procedure is given here; for details please see Protocol 6. 

7. Determine the yield and purity of mRNA by UV spectrophotometry or staining with 
RiboGreen (please see the information panel on QUANTIFICATION OF RNA). 

8. Assess the quality and integrity of the mRNA by denaturing gel electrophoresis through 
formaldehyde-agarose gels (please see information panel on MAMMALIAN, PLANT, AND 

BACTERIAL RNAs). 

9. If the quantity and quality of the isolated mRNA are satisfactory, combine equal amounts 
(weight) of mRNA from each cell line and mix well. Distribute the pooled RNA mixture 
into small aliquots (10-20 |ig), and store them at -80°C or as described in the informa- 
tion panel on STORAGE AND RECOVERY OF RNA. 



Caubrating Different batches of Common reference RNA 

Although every effort is taken to grow the cell lines and isolate the mRNA in a uniform and standardized 
manner, even slight differences in the treatment of cells, or in the quantification of mRNA, may change the 
abundances of mRNA species in the reference set To correlate results obtained from experiments using dif- 
ferent batches of the reference sets, it may be necessary to quantify and correct for batch-to-lDatch varia- 
tion. This correction may be achieved by labeling different batches of reference sets with Cy3 and Cy5 and 
hybridizing them in pair-wise crosses to DNA microarrays. After this experiment has been repeated sever- 
al times, accurate batch-specific "calibration" coefficients can be calculated for each gene on the array 
These coefficients may be used to correct for systematic differences caused by batch-to-batch variation in 
the common reference sets. 
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Each year, over 182,000 women in the United States are 
diagnosed with breast cancer, and approximately 45,000 die 
of the disease. 1 Incidence appears to be increasing in the 
United States at a rate of roughly 2% per yean The reasons 
for the increase are unclear, but non-genetic risk factors appear 
to play a large role. 2 

Five-year survival rates range from approximately 65%- 
85%, depending on demographic group, with a significant 
percentage of women experiencing recurrence of their cancer 
within 10 years of diagnosis. One of the factors most predic- 
tive for recurrence once a diagnosis of breast cancer has been 
made is the number of axillary lymph nodes to which tumor 
has metastasized. Most node-positive women are given adju- 
vant therapy, which increases their survival. However, 20%- 
30% of patients without axillary node involvement also 
develop recurrent disease, and the difficulty lies in how to iden- 
tify this high-risk subset of patients. These patients could 
benefit from increased surveillance, early intervention, and 
treatment. 

Prognostic markers currently used in breast cancer recur- 
rence prediction include tumor size, histological grade, steroid 
hormone receptor status, DNA ploidy, proliferative index, and 
cathepsin D status. Expression of growth factor receptors and 
over-expression of the HER-2/neu oncogene have also been 
identified as having value regarding treatment regimen and 
prognosis. 

HER-2/neu (also known as c-erbB2) is an oncogene that 
encodes a transmembrane glycoprotein that is homologous 
to, but distinct from, the epidermal growth factor receptor. 
Numerous studies have indicated that high levels of expres- 
sion of this protein are associated with rapid tumor growth, 
certain forms of therapy resistance, and shorter disease- free 
survival. The gene has been shown to be amplified and/or 
overexpressed in 10%-30% of invasive breast cancers and in 
40%-60% of intraductal breast carcinoma. 3 

There are two distinct FDA-approved methods by which 
HER-2/neu status can be evaluated: immunohistochemistry 
(IHC, HercepTest™) and FISH (fluorescent in situ hybridiza- 
tion, PathVysion™ Kit), Both methods can be performed on 
archived and current specimens. The first method allows visual 
assessment of the amount of HER-2/neu protein present on 
the cell membrane. The latter method allows direct quantifi- 
cation of the level of gene amplification present in the tumor, 
enabling differentiation between low- versus high-amp] ifica- 
tidh. At least one study has demonstrated a difference in 



recurrence risk in women younger than 40 years of age for 
lowr versus high-amplified tumors (54.5% compared to 
85.7%); this is compared to a recurrence rate of 16.7% for 
patients with no HER-2/neu gene amplification. 4 HER-2/neu 
status may be particularly important to establish in women with 
small (< 1 cm) tumor size. 

The choice of methodology for determination of HER- 2/ 
neu status depends in part on the clinical setting. FDA approval 
for the Vysis FISH test was granted based on clinical trials 
involving 1549 node-positive patients. Patients received one 
of three different treatments consisting of different doses of 
cyclophosphamide, Adriamycin, and 5-fluorouracil (CAF). 
The study showed that patients with amplified HER-2/neu 
benefited from treatment with higher doses of adriamycin- 
based therapy, while those with normal HER-2/neu levels did 
not. The study therefore identified a sub-set of women, who 
because they did not benefit from more aggressive treatment, 
did not need to be exposed to the associated side effects. In 
addition, other evidence indicates that HER-2/neu amplifica- 
tion in node-negative patients can be used as an independent 
prognostic indicator for early recurrence, recurrent disease at 
any time and disease-related death. 5 Demonstration of HER- 
2/neu gene amplification by FISH has also been shown to be 
of value in predicting response to chemotherapy in stage-2 
breast cancer patients. 

Selection of patients for Herceptin 0 (Trastuzumab) mono- 
clonal antibody therapy, however, is based upon demonstra- 
tion of HER-2/neu protein overexpression using HercepTest™. 
Studies using Herceptin 0 in patients with metastatic breast 
cancer show an increase in time to disease progression, 
increased response rate to chemotherapeutic agents and a small 
increase in overall survival rate. The FISH assays have not yet 
been approved for this purpose, and studies looking at response 
to Herceptin^* in patients with or without gene amplification 
status determined by FISH are in progress. 

In general, FISH and IHC results correlate well. However, 
subsets of tumors are found which show discordant results; 
i.e., protein overexpression without gene amplification or lack 
of protein overexpression with gene amplification. The clini- 
cal significance of such results is unclear. Based on the above 
considerations, HER-2/neu testing at SHMC/PAML will uti- 
lize immunohistochemistry (HercepTest 0 ) as a screen, fol- 
lowed by FISH in IHC-negative cases. Alternatively, either 
method may be ordered individually depending on the clini- 
cal setting or clinician preference. 
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HER-2/neu via IHC 

88342 (including interpretive report) 

HER-2/neu via FISH 

88271 x2 Molecular cytogenetics, DNA probe, each 
88274 Molecular cytogenetics, interphase in situ hybrid- 
ization, analyze 25-99 cells 
88291 Cytogenetics and molecular cytogenetics, interpre- 
tation and report 



Procedural Information 

Immunohistochemistry is performed using the FDA-approved 
DAKO antibody kit, Herceptest© The DAKO kit contains 
reagents required to complete a two-step immunohisto- 
chemical staining procedure for.routinely processed, paraffin- 
embedded specimens. Following incubation with the primary 
rabbit antibody to human HER-2/neu protein, the kit employs 
a ready-to-use dextran-based visualization reagent. This re- 
agent consists of both secondary goat anti-rabbit antibody 
molecules with horseradish peroxidase molecules linked to a 
common dextran polymer backbone, thus eliminating the need 
for sequential application of link antibody and peroxidase 
conjugated antibody. Enzymatic conversion of the subse- 
quently added chromogen results in formation of visible 
reaction product at the antigen site. The specimen is then coun- 
terstained; a pathologist using light-microscopy interprets 
results. 

FISH analysis at SHMC/PAML is performed using the 
FDA-approved PathVysion™ HER-2/neu DNA probe kit, pro- 
duced by Vysis, Inc. Formalin fixed, paraffin-embedded breast 
tissue is processed using routine histological methods, and then 
slides are treated to allow hybridization of DNA probes to the 
nuclei present in the tissue section. The Pathvysion™ kit con- 
tains two direct-labeled DNA probes, one specific for the 
alphoid repetitive DNA (CEP 1 7, spectrum orange) present at 
the chromosome 1 7 centromere and the second for the HER- 
2/neu oncogene located at 1 7q 1 1 .2- 1 2 (spectrum green). Enu- 
meration of the probes allows a ratio of the number of copies 
of chromosome 17 to the number of copies of HER-2/neu to 
be obtained; this enables quantification of low versus high 
amplification levels, and allows an estimate of the percentage 
of cells with HER-2/neu gene amplification. The clinically 
relevant distinction is whether the gene amplification is due 
to increased gene copy number on the two chromosome 17 
homologues normally present or an increase in the number of 
chromosome 17s in the cells. In the majority of cases, ratio 
equivalents less than 2.0 are indicative of a normal/negative 
result, ratios of 2.1 and over indicate that amplification is 
present and to what degree. Interpretation of this data will be 
performed and reported from the Vysis-certified Cytogenet- 
ics laboratory at SHMC. 
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Aneuploidy and cancer 

Subrata Sen, PhD 



Numeric aberrations in chromosomes, referred to as aneu- 
ploidy, is commonly observed in human cancer Whether aneu- 
ploidy is a cause or consequence of cancer has long been 
debated. Three lines of evidence now make a compelling case 
for aneuploidy being a discrete chromosome mutation event 
that contributes to malignant transformation and progression 
process. First, precise assay of chromosome aneuploidy in 
several primary tumors with in situ hybridization and compara* 
tive genomic hybridization techniques have revealed that 
specific chromosome aneusomies correlate with distinct tumor 
phenotypes. Second, aneuptoid tumor cell lines and in vitro 
transformed rodent cells have been reported to display an 
elevated rate of chromosome instability, thereby indicating that 
aneuploidy is a dynamic chromosome mutation event associ- 
ated with transformation of cells. Third, and most important, a 
number of mitotic genes regulating chromosome segregation 
have been found mutated In human cancer cells, implicating 
such mutations in induction of aneuploidy in tumors. Some of 
these gene mutations, possibly allowing unequal segregations 
of chromosomes, also cause tumorigenic transformation of 
cells in vitro. In this review, the recent publications investigat- 
ing aneuploidy in human cancers, rate of chromosome instabil- 
ity in aneuploidy tumor cells, and genes implicated in regulat- 
ing chromosome segregation found mutated In cancer cells 
are discussed. CurrOpIn Oncol 2000, 1 252-88 C 2000 Lippincott WOHams 
iWilWns, tnc 
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Cancer research over the past decade has firmly estab- 
lished that malignant cells accumulate a large number of 
genetic mutations that affect differentiation, prolifera- 
tion, and cell death processes. In addition, it is also 
recognized chat most cancers arc clonal, although they 
display extensive heterogeneity with respect to kary- 
otypes and phenotypes of individual clonal populations. 
It is estimated chat numeric chromosomal imbalance, 
referred to as aneuploidy, is the most prevalent genetic 
change recorded among over 20,000 solid tumors 
analyzed thus far flj. Phenotypic diversity of the clonal 
populations in individual tumors involve differences in 
morphology, proliferative properties, antigen expression, 
drug sensitivity, and metastatic potentials. It has been 
proposed that an underlying acquired genetic instability 
is responsible for the multiple mutations detected in 
cancer cells chat lead to tumor heterogeneity and 
progression (2]. In a somewhat contradictory argument, 
it has also been suggested that clonal expansion due to 
selection of cells undergoing normal rates of mutation 
can explain malignant transformation and progression 
process in humans [3|. Acquired genetic instability, 
nonetheless, is considered important for more rapid 
progression of the disease (4"j. Although the original 
hypothesis on generic instability in cancer primarily 
focused on chromosome imbalances in the form of aneu- 
ploidy in tumor cells, the actual relevance of such muta- 
tions in cancer remains a controversial issue. 

Whether or not aneuploidy contributes to the malignant 
transformation and progression process has long been 
debated. A prevalent idea on genetics of cancer referred 
to as "somatic gene mutation hypothesis" contends that 
gene mutations at the nucleotide level alone can cause 
cancer by either activating cellular proto-oncogencs to 
dominant cancer causing oncogenes and/or by inactivat- 
ing growth inhibitory tumor suppressor genes. In this 
scheme of things chromosomal instability in the form of 
aneuploidy is a mere consequence rather than a cause of 
malignant transformation and progression process. 

In this review, some of the recent observations on the 
subject arc discussed and compelling evidence is 
provided to suggest that aneuploidy is a distinct form of 
genetic instability in cancer that frequently correlates 
with specific phenotypes and stages of the disease. 
Furthermore, discrete genetic targets affecting chromo- 
somal stability in cancer cells, recently identified, arc 
also discussed. These data provide a new direction 
toward elucidating the molecular mechanisms res pons i- 
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blc for Induction of ancuploidy in cancer and may even- 
tually be exploited as novel therapeutic targets in the 
future. 

Genetic alterations in cancer 

Alterations in many genetic loci regulating growth, 
senescence, and apoptosis, identified in tumor cells, 
have led to the current understanding of cancer as a 
generic disease. The genetic changes identified in 
tumors include: subtle mutations in genes at the 
nucleotide level; chromosomal translocations leading to 
structural rearrangements in genes; and numeric 
changes in either partial segments of chromosomes or 
whole chromosomes (ancuploidy) causing imbalance in 
gene dosage. 

For the purpose of this review, both segmental and whole 
chromosome imbalances leading to altered DNA dosage 
in cancer cells arc included as examples of ancuploidy. 

Incidence of aneuploldy in cancer 

Evidence of aneuploidy involving one or more chromo- 
somes have been commonly reported in human tumors. 
Although these observations were initially made using 
classic cytogenetic techniques late in a tumor's evolu- 
tion and were difficult to correlate with cancer progres- 
sion, more recent studies have reported association of 
specific nonrandom chromosome ancuploidy with 
different biologic properties such as loss of hormone 
dependence and metastatic potential [5]. 

Classic cytogenetic studies performed on tumor cells 
had serious limitations in scope because they were 
applicable only to those cases in which mitotic chromo- 
somes could be obtained. Because of low spontaneous 
rates of cell division in primary tumors, analyses 
depended on cells either derived selectively from 
advanced metastases or those grown in vitro for variable 
periods of time. In both instances, metaphases analyzed 
represented only a subset of primary tumor cell popula- 
tion. Two major advances in cytogenetic analytic tech- 
niques, in situ hybridization (TSH) and comparative 
genomic hybridization (CGH), have allowed better reso- 
lution of chromosomal aberrations in freshly isolated 
tumor cells [6]. ISH analyses with chromosome-specific 
DNA probes, a powerful adjunct to metaphasic analysis, 
allows assessment of chromosomal anomalies within 
tumor cell populations in the contexts of whole nuclear 
archicecmre and tissue organization. CGH allows 
genome wide screening of chromosomal anomalies 
without the use of specific probes even in the absence 
of prior knowledge of chromosomes involved. Although 
both techniques have certain limitations in terms of 
their resolution power, they nonetheless provide a 
better approximation of chromosomal changes occurring 
among tumors of various histology, grade, and stage 



compared with what was possible with the classic Cyto- 
genetic techniques. Genomic ploidy measurements 
have also been performed at the DNA level with flow 
cytometry and cytofluorometric methods. Although 
these assays underestimate chromosome ploidy due to a 
chromosomal gain occasionally masking a chromosomal 
loss in the same cell, several studies using these 
methods have supported the conclusion that DNA 
ancuploidy closely associates with poor prognosis in 
various cancers [7.8]. This discussion of some recent 
examples published on aneuploidy in cancer includes 
discussion of studies dealing with DNA ploidy measure- 
ments as well* Most of these observations are correlative 
without direct proof of specific involvement of genes on 
the respective chromosomes. Identification of putative 
oncogenes and tumor suppressor genes on gained and 
lose chromosomes in aneuploid tumors, however, are 
providing strong evidence that chromosomes involved in 
aneuploidy play a critical role in the tumorigenic 
process. 

In renal tumors, either segmental or whole chromosome 
aneuploidy appears to be uniquely associated with 
specific histologic subtypes [9], Tumors from patients 
with hereditary papillary renal carcinomas (HPRC) 
commonly show trisomy of chromosome 7, when 
analyzed by CGH. Germlinc mutations of a putative 
oncogene AfRT have been detected in patients with 
HPRC. A recent study [10] has demonstrated that an 
extra copy of chromosome 7 results in nonrandom dupli- 
cation of the mutant MKT allele in HPRC, thereby 
implicating this trisomy in tu m or i genesis. The study 
suggested that mutation of MET may render the cells 
more susceptible to errors in chromosome replication, 
and that clonal expansion of cells harboring duplicated 
chromosome 7 reflects their proliferative advantage. In 
addition to chromosome 7, trisomy of chromosome 17 in 
papillary tumors and also of chromosome 8 in mcsoblas- 
tic nephroma arc commonly seen. Association of specific 
chromosome imbalances with benign and malignant 
forms of papillary renal tumors, therefore, not only 
contribute to an understanding of tumor origins and 
evolution, but also implicate aneuploidy of the respec- 
tive chromosomes in the tumorigenic transformation 
process. 

In colorectal tumors, chromosome ancuploidy is a 
common occurrence. In fact, molecular allclotyping 
studies have suggested that limited karyotyping data 
available from these tumors actually underestimate the 
true extent of these changes, losses of heterozygosity 
reflecting loss of the maternal or paternal allele in 
tumors are widespread and often accompanied by a gain 
of the opposite allele. Therefore, for example, a tumor 
could lose a maternal chromosome while duplicating 
the same paternal chromosome, leaving the tumor cell 
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\yith a normal karyotype and ploidy but an aberrant 
allclotypc. It has been estimated that cancer of the 
colon, breast, pancreas, or prostate may lose an average 
of 25% of its alleles* It is not unusual to discover that a 
tumor has lost over half of its alleles |4|. In clinical 
settings, DNA ploidy measurements have revealed that 
DNA aneuploidy indicates high risk of developing 
severe prcmalignant changes in patients with ulcerative 
colitis, who are known to have an increased risk of 
developing colorectal cancer 111]. DNA aneuploidy has 
been found to be one of the useful indicators of lymph 
node metastasis in patients with gastric carcinoma and 
associated with poor outcome compared with diploid 
cases [12,13], CGH analyses of chromosome aneu- 
ploidy, on the other hand, was reported to correlate gain 
of chromosome 20q with high tumor S phase fractions 
and loss of 4q with low tumor apoptotic indices [14], 
Aneuploidy of chromosome 4 in metastatic colorectal 
cancer has recently been confirmed in studies that used 
unbiased DNA fingerprinting with arbitrarily primed 
polymerase chain reactions to detect moderate gains 
and tosses of specific chromosomal DNA sequences 
[15]. The molecular karyotype (amplotypc) generated 
from colorectal cancer revealed that moderate gains of 
sequences from chromosomes 8 and 13 occurred in 
most tumors, suggesting that ovcrreprcscntution of 
these chromosomal regions is a critical step for metasta- 
tic colorectal cancer. 

In addition to being implicated in tumorigenesis and 
correlated with distinct tumor phenotypes, chromosome 
aneuploidy has been used as a marker of risk assessment 
and prognosis in several other cancers. The potential 
value of aneuploidy as a noninvasive tool to identify 
individuals at high risk of developing head and neck 
cancer appears especially promising. Interphase fluores- 
cence /// situ hybridization (FISH) revealed extensive 
aneuploidy in tumors from patients with head and neck 
squamous cell carcinomas (HKSCC) and also in clini- 
cally normal distant oral regions from the same individu- 
als [16,17). Ic has been proposed that a panel of chromo- 
some probes in FISH analyses may serve as an 
important tool to detect subclinical tumorigenesis and 
for diagnosis of residual disease. The presence of ancu- 
ploid or tctraploid populations is seen in 90% to 95% of 
esophageal adenocarcinomas, and when seen in 
conjunction with Barrett's esophagus, a prcmalignant 
condition, predicts progression of disease [18,19]. 
Chromosome ploidy analyses in conjunction with loss of 
heterozygosity and gene mutation studies in Barrett's 
esophagus reflect evolution of neoplastic cell lineages h 
vivo (20). Evolution of neoplastic progeny from Barrett's 
esophagus following somatic genetic mutations 
frequently involves bifurcations and loss of heterozygos- 
ity at several chromosomal loci leading to aneuploidy 
and cancer. Accordingly, it h hypothesized that during 



tumor cell evolution diploid cell progenitors with 
somatic genetic abnormalities undergo expansion with 
acquired genetic instability. Such instability, often 
manifested in the form of increased incidence of aneu- 
ploidy, enters a phase of clonal evolution beginning in 
prcmalignant cells that proceeds over a period of time 
and occasionally leads to malignant transformation. The 
clonal evolution continues even after the emergence of 
cancer. 

The significance of DNA and chromosome aneu- 
ploidy in other human cancers continue to be evalu- 
ated. Among papillary thyroid carcinomas, ancuptoid 
DNA content in tumor cells was reported to correlate 
with distant metastases, reflecting worsened progno- 
sis |2I|. Genome wide screening of follicular thyroid 
tumors by CGH, on the other hand, revealed frequent 
loss of chromosome 22 in widely invasive follicular 
carcinomas [22]. Chromosome copy number gains in 
invasive neoplasm compared with foci of ductal carci- 
noma in situ (DCIS) with similar histology have been 
proposed to indicate involvement of aneuploidy in 
progression of human breast cancer [23]. ISH analyses 
of cervical intraepithelial neoplasia has provided 
suggestive evidence that chromosomes I, 7 and X 
ancusomy is associated with progression toward cervi- 
cal carcinoma [24]. 

Although the prognostic value of numeric aberrations 
remains a matter of debate in human hematopoietic 
neoplasia, there have been recent studies to suggest that 
the presence of monosomy 7 defines a distinct subgroup 
of acute myeloid leukemia patients [251. It is interesting 
in this context that therapy-related myelodysplasia* 
syndromes have been reported to display monosomy 5 
and 7 karyotypes, reflecting poor prognosis [26/. 

The clinical observations, mentioned previously, arc 
supported by in vitro studies in human and rodent cells in 
which aneuploidy is induced at early stages of transforma- 
tion [27,28]. It is even suggested that aneuploidy may 
cause cell immortalization, in some instances, chat is a 
critical step proceeding transformation. 

Finally, in an interesting study to develop transgenic 
mouse models of human chromosomal diseases, chromo- 
some segment specific duplication and deletions of the 
genome were reported to be constructed in mouse 
embryonic stem cells [29]. Three duplications for a 
portion of mouse chromosome 11 syntenic with human 
chromosome 17 were established in the mouse 
germ line. Mice with 1Mb duplication developed corneal 
hyperplasia and thymic tumors. The findings represent 
the first transgenic mouse model of aneuploidy of a 
defined chromosome segment that documents the direct 
role of chromosome ancusomy in tumorigenesis. 
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Aneuploidy as dynamic cancer-causing 
mutation 9 * instead of a "consequential state" 
In cancer 

According to the hypothesis previously discussed, ancu- 
ploidy represents cither a "gain of function" or "loss of 
function" mutation at the chromosome level with a 
causative influence on the tumorigencsis process. The 
hypothesis, however, is based only on circumstantial 
evidence even though existence of aneuploidy is corre- 
lated with different tumor phenotypes. The existence of 
numeric chromosomal alterations in a tumor does not 
mean that the change arose as a dynamic mutation due 
to genomic instability, because several factors could lead 
to consequential aneuploidy in tumors, also. Although 
aneuploidy as a dynamic mutation due to genomic insta- 
bility in tumor cells would occur at a certain measurable 
rate per cell generation, a consequential state of ancu* 
ploidy in tumors may not occur at a predictable rate 
under similar conditions or in tumors with similar 
phenotypes. In addition to genomic instability, differ- 
ences in environmental factors with selective pressure, 
could explain high incidence of aneuploidy and ocher 
.somatic mutations in tumors compared with normal cells 
[4]. These include humoral, cell substratum, and cell- 
cell interaction differences between tumor and normal 
cell environments. It could be argued that despite 
similar rates of spontaneous aneuploidy induction in 
normal and tumor cells, the latter arc selected to prolif- 
erate due to altered selective pressure in the tumor cell 
environment* whereas the normal cells arc eliminated 
through activation of apoptosis. Alternatively, of course, 
one could postulate that selective expression or overcx- 
prcssion of anti-apoptotic proteins or inactivation of 
prnapoptoric proteins in tumor cells may counteract 
default induction of apoptosis in G2/M phase cells 
undergoing missegrcgatinn of chromosomes. Recent 
demonstration of ovcrcxprcssion of a 02/M phase anti- 
apoprotic protein survtvin in cancer cells [30] suggests 
that this protein may favor aberrant progression of aneu- 
ploid transformed cells through mitosis. This would 
then lead to proliferation of ancuploid cell lineages, 
which may undergo clonal evolution. 

To ascertain that aneuploidy is a dynamic mutational 
event, various human tumor cell lines and transformed 
rodent cell lines have been analyzed for the rate of 
aneuploidy induction. When grown under controlled in 
vitro conditions, such condicions ensure that environ* 
mental factors do not influence selective proliferation of 
cells with chromosome instability. In one study, 
Lcngaiiertf*/ [31 •) provided unequivocal evidence by 
FISH analyses that losses or gains of multiple chromo- 
somes occurred in excess of 10 * per chromosome per 
generation in ancuploid colorectal cancer cell lines. The 
study further concluded that such chromosomal instabil- 
ity appeared to be a dominant trait. Using another in 
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vitro model system of Chinese hamster embryo (CHE) 
cells, Duesbcrg tt ai [32*1 have also obtained similar 
results. With clonal cultures of CHE cells, transformed 
with nongenotoxic chemicals and a mitotic inhibitor, 
these authors demonstrated that the overwhelming 
majority of the transformed colonies contained more 
than 50% aneuploid cells* indicating that aneuploidy 
would have originated from the same cells that under- 
went transformation. All the transformed colonies tested 
were tumorigenic. It was further documented that the 
ploidy factor representing the quotient of the modal 
chromosome number divided by the normal diploid 
number, in each clone, correlated directly with the 
degree of chromosomal instability. Therefore, chromo- 
somal instability was found proportional to the degree of 
aneuploidy in the transformed cells and the authors 
hypothesized that aneuploidy is a unique mechanism of 
simultaneously altering and destabilizing, in a massive 
manner, the normal cellular phenotypes. In the absence 
of any evidence that the transforming chemicals used in 
the study did not induce other somatic mutations, it is 
difficult to rule out the contribution of such mutations 
in the transformation process. These results nonetheless 
make a strong case for aneuploidy being a dynamic chro- 
mosome mutation event intimately associated with, 
cancer. 

Aneuploidy versus somatic gene mutation In 
cancer 

The idea that numeric chromosome imbalance or aneu- 
ploidy is a direct cause of cancer was proposed at the 
turn of the century by Theodore Bovcri [33f. However, 
the hypothesis was largely ignored over the last several 
decades in favor of the somatic gene mutation hypothe- 
sis, mentioned earlier. Evidence accumulating in the 
literature lately on specific chromosome aneusomies 
recognized in primary tumors, incidence of aneuploidy 
in cells undergoing transformation, and ancuploid tumor 
cells showing a high rate of chromosome instability have 
led to the rejuvenation of Boveri's hypothesis. The 
concept has recently been discussed as a "vintage wine 
in a new bottle* 1 [34*]. The author points out that 
except for rare cancers caused by dominant retroviral 
oncogenes, diploidy does not seem to occur in solid 
tumors, whereas aneuploidy is a rule rather than excep- 
tion in cancer. 

Aneuploidy as an effective mutagenic mechanism 
driving tumor progression, on the other hand, is being 
recognized as a viable solution to the paradox that with 
known mutation rate in non-gcrmline cells (~J0* 7 per 
gene per cell generation) tumor cell lineages cannot 
accumulate enough mutant genes during a human life- 
time [351. The concept is gaining significant credibility 
since genes that potentially affect chromosome segrega- 
tion were found mutated in human cancer. Some of 
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these genes have also been shown to have transforming 
capability in fa vitro assays, Selected recent publications 
describing the findings are being discussed below in 
reference to the mitotic targets potentially involved in 
inducing chromosome segregation anomalies in cells. 

Potential mitotic targets and molecular 
mechanisms of aneuploidy 

Because aneuploidy represents numeric imbalance in 
chromosomes, it is reasonable to expect that aneuploidy 
arises due to missegregarion of chromosomes during cell 
division. There arc many potential mitotic targets, 
which could cause unequal segregation of chromosomes 
(Fig. I). Recent investigations have identified several 
genes involved in regulating these mitotic targets and 
mitotic checkpoint functions, which can be implicated 
in induction of aneuploidy in tumor cells. This discus- 
sion is restricted to those mitotic targets and checkpoint 
genes whose abnormal functioning has been observed in 
cancer or has been shown to cause tumorigenic transfor- 
mation of cells, in recent years. The role of telomeres is 
discussed elsewhere in this issue. For a more detailed 
description of the components of mitotic machinery and 
their possible involvement in causing chromosome 
segregation abnormalities in tumor cells, readers may 
refer to a recently published review [36«], 

Among the mitotic targets implicated in cancer, centra* 
some defects have been observed in a wide variety of 
malignant human tumors. Ccntrosome* play a central role 
in organizing the microtubule network in interphase cells 
and mitotic spindle during cell division* Multipolar 
mitotic spindles have been observed in human cancers in 
situ and abnormalities in the form of supernumerary 
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ccntrosomcs, ccntrosomcs of aberrant size and shape as 
well as aberrant phosphorylation of ccntrosome proteins 
have been reported in prostate, colon, brain, and breast 
tumors (37,38]. In view of the findings that abnormal 
ccntrosomcs retain the ability to nucleate microtubules fa 
vitro, it is conceivable that cells with abnormal centro- 
somcs may missegregatc chromosomes producing ancu- 
ploid cells, The molecular and genetic bases of abnormal 
ccntrosome generation and the precise pathway through 
which they regulate the chromosome segregation process 
remain to be elucidated. Recent discovery of a ccntro* 
some-associated kinase STKI5/BTAK/aurora2, naturally 
amplified and overexpressed in human cancers, has raised 
the interesting possibility that aberrant expression of this 
kinase is critically involved in abnormal ccntrosome func- 
tion and unequal chromosome segregation in tumor cells 
f39 t 40]. Exogenous expression of the kinase in rodent and 
human cells was found to correlate with an abnormal 
number of ccntrosomcs, unequal partitioning of chromo- 
somes during division, and tumorigenic transformation of 
cells. It is relevant in this context to mention that the 
Xcnopus homologuc of human STK 1 5/RTAK/au roraZ 
kinase has recently been shown to phosphorylatc a micro* 
tubule motor protein XIKg5, the human orthologttc of 
which is known to participate in the centrosome separa- 
tion during mitosis |41J. Findings on STK15/aurora2 
kinase, thus, provide an interesting lead to a possible 
molecular mechanism of centrosomc's role in oncogene- 
sis. Ccntrosorncs have, of late, been implicated in onco- 
genesis from studies revealing supernumerary centro- 
somes in y»5.?-deficicm fibroblasts and overcxprcssion of 
another ccntrosome kinase PLK1 being detected in 
human non-small cell lung cancer [42]. 

One of the critical events that ensures equal partition- 
ing of the chromosomes during mitosis is the proper 
and timely separation of sister chromatids that arc 
attached to each other and to the mitotic spindle. 
Untimely separation of sister chromatids has been 
suspected as a cause of aneuploidy in human tumors. 
Cohesion between sister chromatids is established 
during replication of chromosomes and is retained until 
the next mctaphasc/anaphasc transition. It has been 
shown that during mctaphasc-anaphase transition, the 
anaphase promoting cornplex/cyclosome triggers the 
degradation of a group of proteins called sccurins that 
inhibit sister chromatid separation. A vertebrate securin 
(y-sccurin) has recently been identified that inhibits 
sister chromatid separation and is involved in transfor- 
mation and tumorigencsis. Subsequent analysis 
revealed that the human securin is identical to the 
product of the gene called pituitary tumor transforming 
gene, which is overexpressed in some tumors and 
exhibits transforming activity in NIH3T3 cells. It is 
proposed that elevated expression of the v-sccurin may 
contribute to generation of malignant tumors due to 
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chromosome gain or loss produced by errors in chro- 
matid separation 143*1* 

Normal progression through mitosis during prophase to 
anaphase transition is monitored at least at two check- 
points; One checkpoint operates during early prophase 
at G2 to meraphase progression while the second 
ensures proper segregation of chromosomes during 
mccaphasc to anaphase transition* Several mitotic 
checkpoint genes responding to mitotic spindle defects 
have been identified in yeast. The mctaphasc-anaphase 
transition is delayed following activation of this check- 
point during which kinetochorcs remain unattached to 
the spindle. The signal is transmitted through a kineto- 
chore protein complex consisting of MpsJp and several 
Mud and Bub proteins [44]. it is expected that for 
unequal chromosome segregation to be perpetuated 
through cell proliferation cycles giving rise, to ancu- 
ploidy, checkpoint controls have to be abrogated. 



Conclusion 

Growing evidence from human tumor cytogenetic inves- 
tigations strongly suggest that ancuploidy is associated 
with the development of tumor phenorypes. Clinical 
findings of correlation between ancuploidy and rumori- 
genesis are supported by studies with in vitro grown 
transformed cell lines. Molecular genetic analyses of 
tumor cells provide credible evidence that mutations in 
genes controlling chromosome segregation during 
mitosis play a critical role in causing chromosome insta- 
bility leading to ancuploidy in cancer. Further elucida- 
tion of molecular 3nd physiologic bases of chromosome 
instability and ancuploidy induction could lead to the 
development of new therapeutic approaches for 
common forms of cancer. 
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Following this logic, Vogelstcin eta/, [45*] hypothesized 
that ancuploid tumors would reveal mutation in mitotic 
spindle checkpoint genes. Subsequent studies by these 
investigators have proven the validity of this hypothesis 
and a small fraction of human colorectal cancers have 
revealed the presence of mutations in either hBubt or 
hBubRl checkpoint genes. It was further revealed that 
mutant BUB1 could function in a dominant negative 
manner conferring an abnormal spindle checkpoint 
when expressed exogenously. Inactivation of spindle 
checkpoint function in virally induced leukemia has also 
recently been documented following the finding that 
hMADl checkpoint protein is targeted by the Tax 
protein of the human T-cell leukemia virus type 1, 
Abrogation of hMADl function leads to multinuclcatton 
and ancuploidy [46]. 

In addition to mitotic spindle checkpoint defects* failed 
DNA damage checkpoint function in yeast is frequently 
associated with aberrant chromosome segregation as 
well. It, therefore, appears intriguing yet relevant that 
the human BRCAI gene, proposed to be involved in 
DNA damage checkpoint function, when mutated by a 
targeted deletion of exon 1 1 led to defective G2M cell 
cycle checkpoint function and genetic instability in 
mouse embryonic fibroblasts [47]. The cells revealed 
multiple functional cenrrosomcs and unequal chromo- 
some segregation and ancuploidy. Although the molecu- 
lar basis for these abnormalities is not known at this 
time, it raises the interesting possibility that such an 
ancuploidy-driven mechanism may be involved in 
tumorigencsis in individuals carrying germline muta- 
tions of BRCAt genc. 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas* Here we report the identification 
of two genes, WISP*1 and WISP-2> that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP'3, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1, These included (i) C57MG cells infected with a WnM 
retroviral vector or expressing Wnt-1 under the control of a 
tetracyline repressible promoter, and (h) Wnt-1 transgenic 
mice. The WISP* I gene was localized to human chromosome 
8q24.1-8q243, WISP-i genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISP-3 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 4 0-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to >30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinasc-3j3 (GSK-3/3) resulting in~ an increase in 
0-catcnin levels. Stabilized fi-catenin interacts with the tran- 
scription factor TCF/Lcfl, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
/3-catenin levels (9). APC is phosphorylated by GSK-3/3, binds 
to j3-catenin, and facilitates its degradation. Mutations in 
either APC or j3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3 } a member of 
the transforming growth factor (TGF)-j3 superfamily, and the 
homcobox genes, engrailed, goosecoid, twin (Xtwn), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell phe no- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phe notype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP -2, and a third related gene, WISP : 3. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov } a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Seiect cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 

Data deposition: The sequences reported in this paper have been 
deposited in the Gcnbank database (accession nos. AF J 00777, 
AF100778, AF100779, AF100780, and AF1 00781). 
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cDNA was synthesized from 2 fxg of poly(A) + RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 M-g 
of poly(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/ Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WlSP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 uJvl of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glycer aide hyde-3 -phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P- labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzoL 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR, Gcnc-spccific primers and 
fluorogenic probes (sequences available on. request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2 (Art> where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
d-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The H75P-specific signal was 
normalized to that of the gIyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 



mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (1.1). Candidate differentially ex- 
pressed cDNAs (1384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2, were differentially 
expressed, being induced in the C57MG/ Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and B). Wnt-4, unlike WnM, does not induce 
the morphological transformation of C57MG cells and has no 
effect on j3-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C5 7MG cells express- 
ing the Wnt-1 gene under the control of a tetracyclinc- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRN A (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response, to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-L The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of ^40,000 (M t 40 K). Both have 
hydrophobic N -terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 24). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of ~ 27,000 (M t 27 K) (Fig. 25). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 
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Fig. 1 . WISP-1 and WISP-2 are induced by Wnt-1 , but not Wnt-4, 
expression in C57MG cells. Northern analysis of WlSP-1 {A) and 
WISP-2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 /xg) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP- /-specific probe 
(amino acids 278-300) or a 190-bp WISP- 2- specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rchybridized with 
human 0-actin probe. 
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Fio. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-1 (A) and mouse and human WISP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined, 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-1. 

Identification of WISP-3, To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-t protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
W1SP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 3A). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1 1 WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elm! gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-1 (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene now CTGF is a chemotactic and mitogen ic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced byTGF-0 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig.*3#) (21). The N-lerminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin- like growth factor (IGF)- 
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Fig, 3. {A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WiSP-3 are indicated with a dot. (fl) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF- BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and /?). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in ail CCN family members described to date but is 
absent in WISP-2 (Fig. ?>A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung, Predominant expression of WISP-3 was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISP-3 
expression were detected in placenta, ovary, prostate, and 
small intestine, 

In Situ Localization of WISP-1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice, Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WiSP-l expression also was observed locally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 E-N). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 
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Fro. 4. (/4 , C, £, and G) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP*l expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power {A and B), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-I, however, was observed in tumor cells in 
some areas, images of WISP-2 expression are shown in E-H. At low 
power (E and F), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 



the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-I is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISP-3 mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(iod = 1,000), WISP-3 is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB {21, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors, Amplification of protooncogencs is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PGR and Southern blot analysis. (Fig. 
5/t and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and AViDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISP-3 was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 
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Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. {A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 /tg) 
digested with EcoKH (WISP-I) or Xbai (Q-myc) were hybridized with 
a 100-bp human WISP- J probe (amino acids 186-2 J 9) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PGR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WISP-3 RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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FiO. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means £ SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 
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mucosa. The amount of overexpression of WISPS ranged from 
4- to >40-fold. 

DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-1 1 WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wni-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., /3-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-1 -transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through 0-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-J3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32), It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WJSP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin ct v fo serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective . tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-j31, which is the stimulus for 
stromal proliferation (34). TGF-/31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor ceils 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WJSP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
W1SP-3 RNA was seen in the absence of DNA amplification, 
in contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-I, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and /3-catenin (39). Mutations in specific regions 
of cither gene can cause the stabilization and accumulation of 
cytoplasmic )3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISH in human 
colon tumors may indicate an important role for these genes 
in tumor development. 
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We have determined the relationship between mRNA and protein expression levels for selected genes 
expressed in the yeast Saccharomyces cerevisiae growing at mid-log phase. The proteins contained in total yeast 
cell lysate were separated by high-resolution two-dimensional (2D) gel electrophoresis. Over 150 protein spots 
were excised and identified by capillary liquid chromatography-tandem mass spectrometry (LC-MS/MS). 
Protein spots were quantified by metabolic labeling and scintillation counting. Corresponding mRNA levels 
were calculated from serial analysis of gene expression (SAGE) frequency tables (V. £. Velculescu, L, Zhang, 
W. Zhou, J. Vogelstein, M. A. Basrai, D. E. Bassett, Jr., P. Hieter, B. Vogelstein, and K. W. Kinzler, Cell 
88:243-251, 1997). We found that the correlation between mRNA and protein levels was insufficient to predict 
protein expression levels from quantitative mRNA data. Indeed, for some genes, while the mRNA levels were 
of (he same value the protein levels varied by more than 20-fold. Conversely, invariant steady-state levels of 
certain proteins were observed with respective mRNA transcript levels that varied by as much as 30-fold. 
Another interesting observation is that codon bias is not a predictor of either protein or mRNA levels. Our 
results clearly delineate the technical boundaries of current approaches for quantitative analysis of protein 
expression and reveal that simple deduction from inRNA transcript analysis is insufficient 



The description of the state of a biological system by the 
quantitative measurement of the system. constituents is an es- 
sential but largely unexplored area of biology. With recent 
technical advances including the development of differential 
dispIay-PCR (21), of cDNA inicroarray and DNA chip tech- 
nology (20, 27), and of serial analysis of gene expression 
(SAGE) (34, 35), it is now feasible to establish global and 
quantitative mRNA expression profiles of ceils and tissues in 
species for which the sequence of all the genes is known. 
However, there is emerging evidence which suggests that 
mRNA expression patterns are necessary but are by them- 
selves insufficient for the quantitative description of biological 
systems. This evidence includes discoveries of posttraascrip- 
tional mechanisms controlling the protein translation rate (15), 
the half-lives of specific proteins or mRNAs (33), and the 
intracellular location and molecular association of the protein 
products of expressed genes (32). 

Proteome analysis, defined as the analysis of the protein 
complement expressed by a genome (26), has been suggested 
as an approach to the quantitative description of the state of a 
biological system by the quantitative analysis of protein expres- 
sion profiles (36). Proteome analysis is conceptually attractive 
because of its potential to determine properties of biological 
systems that are not apparent by DNA or mRNA sequence 
analysis alone. Such properties include the quantity of protein 
expression, the subcellular location, the state of modification, 
and the association with ligands, as well as the rate of change 
with time of such properties. In contrast to the genomes of a 
number of microorganisms (for a review, see reference 11) and 
the transcripiome of Saccharomyces cerevisiae (35), which have 
been entirely determined, no proteome map has been com- 
pleted to date. 

The most common implementation of proteome analysis is 
the combination of two-dimensional gel electrophoresis (2DE) 
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(isoelectric focusing-sodium dodecyl sulfate [SDS]-polyacryl- 
amide gel electrophoresis) for the separation and quantitation 
of proteins with analytical methods for their identification. 
2DE permits the separation, visualization, and quantitation of 
thousands of proteins reproducibly on a single gel (18, 24). By 
itself, 2DE is strictly a descriptive technique. The combination 
of 2DE with protein analytical techniques has added the pos- 
sibility of establishing the identities of separated proteins (1, 2) 
and thus, in combination with quantitative mRNA analysis, of 
correlating quantitative protein and mRNA expression mea- 
surements of selected genes. 

The recent introduction of mass spectrometric protein anal- 
ysis techniques has dramatically enhanced the throughput and 
sensitivity of protein identification to a level which now permits 
the large-scale analysis of proteins separated by 2DE. The 
techniques have reached a level of sensitivity that permits the 
identification of essentially any protein that is detectable in the 
gels by conventional protein staining (9, 29). Current protein 
analytical technology is based on the mass spectrometric gen- 
eration of peptide fragment patterns that are idiotypic for the 
sequence of a protein. Protein identity is established by corre- 
lating such fragment patterns with sequence databases (10, 22, 
37). Sophisticated computer software (8) has automated the 
entire process such that proteins are routinely identified with 
no human interpretation of peptide fragment patterns. 

In this study, we have analyzed the mRNA and protein levels 
of a . group of genes expressed in exponentially growing cells of 
the yeast S. cerevisiae. Protein expression levels were quantified 
by metabolic labeling of the yeast proteins to a steady state, 
followed by 2DE and liquid scintillation counting of the se- 
lected, separated protein, species.. Separated proteins were 
identified by in-gel Lxyptic digestion of sputa with subsequent 
analysis by microspray liquid chromatography-tandem mass 
spectrometry (LC-MS/MS) and sequence database searching. 
The corresponding mRNA transcript levels were calculated 
from SAGE frequency tables (35). 

This study, for the first time, explores a quantitative com- 
parison of mRNA transcript and protein expression levels for 
a relatively large number of genes expressed in the same met- 
abolic state. The resultant correlation is insufficient for predic- 
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FIG. 1 . Schematic illustration of prorcomc aim lysis by 2DE and mast spectrometry. In part 1, prole ins are separated by 2DE, stained spots are excised and subjected 
to in-gc) digestion with trypsin, and the resulting peptides are separated by oa-line capillary high-performance liquid chromatography, in part li, a peptide is shown 
ehjtiag from the column in part 1. The peptide is ionized by electrospray ionization and enters the mass spectrometer. The mass of the ionized peptide is detected, and 
the first quadrupole mass filter allows only the specific mass -to -charge ratio of the selected peptide ion to pass into the collision cell, in the collision ceil, the energized, 
ionized peptides collide with neutral argon gas molecules, fragmentation of the peptide is essentially random but occurs mainly at the peptide bonds, resulting ia smaller 
peptides of differing lengths (masses). These peptide fragments ate detected as a tandem mass (MS/MS) spectrum in the third quadrupole mass filter where two iou 
series are recorded simultaneously, one each from sequencing inward from the N and C termidt of the peptide, respectively. In part 111, the MS/MS spectrum from the 
selected, ionized peptide is compared to predicted tandem mass spectra computer generated' from a sequence database. Provided thar the peptide sequence exists in 
the database, the peptide and, by association, the protein from which the peptide was derived can be identified. Unambiguous protein identification is attained in a single 
analysis because multiple peptides are identified as being derived from the same protein. 



tion of protein levels from mRNA transcript levels. We have 
also compared the relative amounts of protein and mRNA 
with the respective cod on bias values for the corresponding 
genes. This comparison indicates that codon bias by itself is 
insufficient to accurately predict either the mRNA or the pro- 
tein expression levels of a gene. In addition, the results dem- 
onstrate that only highly expressed proteins are detectable by 
2DE separation of total cell lysates and that therefore the 
construction of complete proteome maps with current technol- 
ogy will be ver/ challenging, irrespective of the type of organ- 
ism. 

MATERIALS AND METHODS 

Yeast strain and growth conditions. The source of protein and message tran- 
scripts for all experiments was YFH490 {MATn ura3-52 fys2*80I advl-Wl 
Icu2 &J hitJ-A200 ttpI-&63) (3(1). Logarithmically growing cells were obtained by 
growing yeast cells io early log phase (3 X 10* cells/ml) in YPD rich medium 
(YPD supplemented with 6 roM uracil, 4.8 ra.M adenine, a ad 24 mM tryptophan) 
at 30 (35). Metabolic labeling of prole iu was accomplished in YPD medium 



exactly as described elsewhere (4) with the exception that 1 ml of cells was 
labeled with 3 mCi to offset methionine present tn YPD medium. Protein was 
harvested as described by GarxeLs and coworkers (12). Harvested protein was 
lyopuilized, resuspended in isoelectric focusing get rehydration solution, and 
stored at -80°C. 

2DE. Soluble proteins were run in the first dimension by using a commercial 
flatbed electrophoresis system (Multiphor Jl; Pharmacia Biotech), immobilized 
polyacrylamide gel (IPG) dry strips with nonlinear pH 3.0 to 10.0 gradients 
(Au»crsham-Pharmacia Biotech) were used for the first-di weapon separation. 
Forty micrograms of proteio from whole-cell lysates was mixed with JPG strip 
rehydration buffer (8 M urea, 2% Nonidet P-40, 10 mM dithtothreitol), and 250 
to 380 of inferior: \ytis to ir/livuhjHi of LTG strip rshyia'u&ta 

tray (Amersham-Pharmacia Dictecii). The strips were a'l lowed to rehydrate at 
room temperature for 1 h. The samples were run at 300 V-IO mA-5 W for 2 h, 
then ramped to 3,5(10 V-10 mA-5 W over a period of 3 h, and then kept at 3,500 
V-lfl mA-5 W /or 15 to 19 h. At the end of the first -dimension inn (60 to 70 kV • 
h), the IPG strips were reequjlibrated for 8 min in 2% (wt/vol) dilhiothrcitol in 
2% (wt/vol) SDS-6 M urea-30% (wt/vol) glycerol-fl.05 M Tris HC1 (pH 6.8) and 
for 4 min in 2.5% iodoacetamide in 2% (wt/vol) SDS-6 M urea-30% (wt/vol) 
glyoeroi-0.05 M Tris HQ (pH 6.8). Following reeqmlibration, the strips were 
transferred and apposed to 10% polyacrylamide second-dimension gels. Poly- 
acrylamide gels were poured iu a casting stand with 10% acrylaimdc-2.67% 
piperazine diacrylKraide-OJ75 M Trisbase-HCl (pH 8.8)-0.1% ( wt/vol) SDS-0.05% 
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FIG. 2. 2D silver-stained gel of the proteins in yeast total cell lysate. Proteins were separated in the first dimension (horizontal) by isoelectric focusing and then ia 
the second dimension (vertical) by molecular weight sieving. Protein spots (156) were chosen to include the entire range of molecular weights, isoelectric focusing points, 
y ad staining intensities. Spots were excised, and Ihc corresponding protein was identified by mass spectrometry and database searching. The spots are labeled on the 
gel and correspond to the data presented in Table 1. Molecular weights are given in thousands. 



(wt/vol) ammonium persuJfate-0.05% TEMED (W^V'/Z'-tetranieihyletbyl- 
encdiamiae) hi Milli-Q water. The appatutus used to nm second-dimension gels 
was a n on co miner did apparatus from Oxford Glycoscieaces, Inc. Once the IPG 
strips were apposed to the second-dimension gels, they were immediately run ;it 
SO mA (constaut)~500 V-H5 W for 20 inin, followed by 21)0 inA (const ant )-50fJ 
V-85 W until the buffer front line was 10 to 15 mm from the bottom of the gel. 
Gels were removed and silver stained according to the procedure of Shevcheuko 
ct aL (29). 

Protein tdentftlratlon. Gels were exposed to X-ray film overnight, and then the 
silver staining and film were used to excise 156 spots of varying intensities, 
molecular weighr*, aid isocleciric focusing points. Jt» order to increase ihe 
detection I unit by tnajs spcL't.Giiietiy, spots *cre cut out and poo led from up to 
four identic a 1 cold; silver-stained gels, ln-gol tryptic digests of pooled spots were 
performed as described previously (29). Tryptic peptides were analyzed by uii- 
cr ocapilldry LC-MS with automated switching to MS/MS mode for peptide 
fragmentation. Spectra were searched against the composite OWL protein se- 
quence database (version 30.2; 250,514 protein .sequences) (24a) by using the 
computer program Sequest (8), which matches theoretical and acquired tandem 
mass spectra. A protein mutch was determined by comparing the number of 
peptides identified and their respective cross-correlation scores. All protein 
identifications were verified by comparison with theoretical molecular weights 
and isoelectric points. 



mRNA quantitation. Velculcscu and coworkers have previously generated 
frequency tables for yeast mRNA transcripts from the same strain grown under 
the same stated conditions as described herein (35). The SAGE technology is 
based ou two main principles. First, a short sequence tag (15 bp) that contaius 
sufficient information uniquely to identify a transcript is generated. A single tag 
is usually generated from each mRNA transcript in the cell which corresponds to 
IS bp at the 3'-most cutting site for MzllL Second, many transcript tags can be 
concatenated into a single molecule and then sequenced, revealing the identity of 
multiple tags simultaneously. Over 20,001) transcripts were sequenced from yeast 
strain YPH499 growing at mid-log phase on glucose. Assuming Ihe previously 
derived er.uDvi(« of 15,1)00 u>KNA molecules per cell (16), Ibis-would represe.i! 
a 1.3-fold coverage even for mRNA molecules present at a single copy per cell 
and would provide a 72% probability of detecting such transcripts. Computer 
software which took for input the gene detected, examined the nucleotide se- 
quence, and perforated the calculation as described by Velculescu and coworkers 
(35) was written. In practice, we found that for 21 of 128 (16%) genes examined 
viable mRNA levels from SAGE data could nut be calculated. This was because 
(i) no CATG site was found in the open reading frame (ORF), (ii) a CATG site 
was found but the corresponding 10-bp putative SAGE tag was not found in the 
frequency tables, or (ui) identical putative SAGE tags were present for multiple 
genes (e.g., TDH2.YEAST and TDH3„YEAST). 
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TABLE I. Expressed genes identified from 2D gel in Fig. 2 TABLE J— Continued 



Mot wt 


pi 


Spot no. 


YPD gene 
name" 


Protein 
abundance 
(10 3 copies/ 
cell) 


mRNA 
abundance 
(copies/ceJI) 


Codon 
bias 


17,259 


6.75 


133 


CPRt 


15.2 


61.7 


0.769 


18J02 


4.80 


83 


EGD2 


20.1 


5.2 


0.724 


.18.726 


4.44 


1.47 


YKL056C 


61.2 


88.4 


0,831 


18,978 


5.95 


135 


YER067W 


3.7 


6,7 


0.118 


19,108 


5.04 


130 


YLR109W 


94.4 


9,7 


0.680 


19,681 


9.08 


136 


ATP7 


11,0 


NA"" 1 '" 


0.246 


20,505 


6.07 


in 


GUK1 


16.5 


3,7 


0.422 


21.444 


5.25 


148 


SARI 


5.4 


10.4 


0.455 


21.583 


4.98 


95 


TSA1 


110.6 


40.1 


0.845 


22,602 


4.30 


80' 


EFB1 


66.1 


23.8 


0.875 


23,079 


6,29 


112 


SOD2 


12.6 


2.2 


0.351 


23,743 


5.44 


137 


HSP26 


NA' 


0.7 


0.434 


24,033 


5.97 


96 


ADK1 ' 


17.4 


16.4 


0.656 


24.058 


4.43 


143 


YKL117W 


29.2 


10.4 


0.339 


24,353 


6.30 


140 


TFSl 


8.1 


0.7 


0.146 


24,662 


5.85 


99 


URA5 


25.4 


6.0 


0.359 


24,808 


6.33 


97 


OSP1 


26.3 


5.2 


0.735 


24,908 


8.73 


122 


RPS5 


18.6 


NA fl 


0.899 


25,081 


4,65 


81 


MRP8 


9.3 


NA r 


0.241 


25,960 


6.06 


116 


RPE1 


5.8 


0.7 


0.372 


26,378 


9.55 


127 


RPS3 


96.8 


NA* 


0.863 


2M67 


5.18 


100 


VMA4 


10.5 


3.7 


0.427 


26^661 


5.84 


98 


TPI1 


NA J 


NA* 


0.900 


27^56 


5.56 


93 


PRE8 


6.9 


0.7 


0.129 


27^34 


6.13 


115 


YHR049W 


18.4 


2.2 


0.520 


27^472 


5.33 


92 


YNLOI0W 


31.6 


3.7 


0.421 


27,480 


8.95 


123 


GPM1 


10.0 


169.4 


0.902 


27,480 


8.95 


124 


GPM1 


231.4 


169.4 


0.902 


27^480 


8.95 


125 


GPM1 


7.5 


169.4 


0.902 


27,809 


5.97 


139 


HOR2 


5.7 


0.7 


0.381 


27,874 


4,46 


78 


VST1 


13.6 


52.8 


0.805 


28,595 


4.51 


41 


PUP2 


4.4 


0.7 


0.147 


29,156 


6,59 


114 


YMR226C 


14.5 


2.2 


0.283 


29.244 


8.40 


120 


DPMI 


5.0 


11.2 


0.362 


29,443 


5.91 


48 


PRE4 


3.4 


3.7 


0.162 


30,012 


6.39 


138 


PRB1 


21.2 


L5 


0.449 


30,073 


4.63 


77 


BMH1 


14.7 


28,2 


0,454 


30,296 


7.94 


121 


OMP2 


67.4 


41.6 


0.499 


30^435 


6.34 


89 


GPP1 


70.2 


11.2 


0,703 


31.332 


5.57 


88 


ILV6 


13.9 


3.0 


0.402 


32,159 


5.46 


113 


IPP1 


63.1 


3,7 


0.752 


32,263 


6,00 


149 


tDSl 


22.4 


4.5 


0.232 


33.311 


5.35 


84 


SPE3 


15.1 


6.7 


0.468 


34,465 


5. 60 


129 


ADE1 


8.7 


5.2 


0.305 


34,762 


5.32 


85 


SEC 14 


10.9 


6.0 


0.373 


34,797 


5.85 


42 


URA1 


49.5 


8.9 


0.237 


34,799 
35,556 


6.04 


90 


BEL1 


103.2 


81.0 


0.875 


5.97 


43 


YDL124W 


6.4 


4.5 


0.206 


35/»19 


8.41 


59 


TDH1 


69.8 


32.7 C 


0.940 


35,650 


5.49 


68 


CAR1 


5.2 


3.0 


0.339 


35.712 


6.72 


1 17 


TDH2 


49.6 


473. if 


0.982 


35,712 


6.72 


154 


TDH2 


863.5 


473.0° 


0.982 


3^ 71? 


6 72 




TOH7 


70 d 






36,272 


4.85 


128 


APA1 


8.7 


0.7 


0.425 


36,358 


5.05 


75 


YJR105W 


17.6 


ru 


0S22 


36,358 


5.05 


76 


YJR105W 


27.5 


17.1 


0.522 


36,596 


6.37 


79 


ADH2 


58.9 


260.0* 


0.711 


36,714 


6.30- 


102 


ADH1 


746. 1 


26U.0 . 


0,E>5 5 


36,714 


6.30 


103 


ADH1 


17.6 


'260.0 


0.913 


36.714 


6,30 


104 


ADH1 


61.4 


260.0 


0.913 


36,714 


6.30 


105 


ADH1 


52.7 


260.0 


0.913 


37,033 


6.23 


44 


TALI 


44.8 


3.7 


0.701 


37,796 


7.36 


57 


IDH2 


29.4 


6.7 


0.330 


37,886 


6.49 


106 


ILV5 


76.0 


4.5 


0.892 


38.700 


7.83 


55 


BAT1 


30.9 


11.2 


0.469 


38,702 


6.24 


46 


QCR2 


NA* 7 


2.2 


0.326 



Mol wt 




Spot no. 


name" 


Protein 

flhn nHftnro 

CUiU lIUAIIwV 

(10* 1 copies/ 
cell) 


mRNA 
abundance 
(copics/cell) 


Codon 
bias 


39,477 


5.58 


86 


FBA1 


17.8 


183.6 


0.935 


39,477 


5.58 


87 


FBA1 


427.2 


183.6 


0.935 


39,540 


6.50 


150 


HOM2 


60.3 


4.5 


0.592 


39,561 


6.12 


156 


PSA1 


96.4 


27.5 


0.718 


41,158 


6.01 


49 


YNL134C 


14.9 


1.5 


. 0.316 


41,623 


7.18 


58 


BAT2 


19.0 


3.9 


0.250 


41,728 


7.29 


110 


ERG 10 


24.1 


4.5 


0.543 


41,900 


5.42 


74 


TOM40 


22.3 


2.2 


0.375 


42,402 


6.29 


45 


CYS3 


6.7 


8.9 


0.621 


42,883 


5.63 


67 


DYS1 


15.8 


5.2 


0.526 


43.409 


6.31 


107 


SER1 


10.5 


1.5 


0,292 


43,421 


5.59 


91 


ERG6 


2.2 


14.1 


0.40S 


44,174 


7.32 


56 


YBR025C 


13.1 


6.0 


0.684 


44,682 


4.99 


72 


TIF1 


2.9 


39.4 


0.834 


44,707 


7.77 


108 


PGK1 


23.7 


165.7 


0.897 


44,707 


7.77 


109 


PGK1 


315.2 


165.7 


0.897 


46,080 


6.72 


30 


CAR2 


15.4 


NA tf 


0.495 


46,383 


8.52 


53 


IDP1 


7.7 


0.7 


0.436 


46,553 


5.98 


47 


IDP2 


32.4 


NA< 


0.197 


46,679 


6.39 


50 


ENOl 


35.4 


0.7 


0.930 


46,679 


6.39 


51 


EN01 


6.6 


0.7 


0.930 


46,679 


6.39 


52 


ENOl 


2.2 


0.7 


0.930 


46,773 


5.82 


63 


EN02 


15.5 


289.1 


0.960 


46,773 


5.82 


64 


EN02 


635.5 


289.1 


0.960 


46,773 


5.82 


65 


EN02 


93.0 


289.1 


0.960 


46,773 


5.82 


66 


EN02 


31.0 


289.1 


0.960 


47,402 


6.09 


126 


COR1 


2.5 


0.7 


0.422 


47,666 


8.98 


54 


AAT2 


11.7 


6.0 


0.338 


48,364 


5.25 


73 


WTM1 


74.5 


13.4 


0.365 


48.530 


6.20 


61 


MET17 


38.1 


29.0 


0.576 


48,904 


5.18 


69 


LYS9 


16.2 


3.7 


0.463 


48,987 


4.90 


153 


SUP45 


29.6 


IL9 


0.377 


49,727 


5.47 


70 


PR02 


13.6 


5.2 


0,297 


49,912 


9.27 


62 


TEF2 


558.5 


282.0 


0.932 


50,444 


5.67 


35 


YDR190C 


4.8 


2.2 


0.228 


50,837 


6.11 


32 


YEL047C 


3.8 


1.5 


0.387 


50,891 


4.59 


151 


TUB2 


11.2 


7.4 


0.404 


51,547 


6.80 


27 


LPD1 


18.9 


2.2 


0.351 


52,216 


7.25 


29 


SHM2 


19.7 


7.4 


0.722 


52,859 


5.54 


37 


YFR044C 


30.2 


6.7 


0.442 


53,798 


5.19 


71 


HXK2 


26.5 


7,4 


0.756 


53.803 


6.05 


145 


GYP6 


4.4 


0.7 


0.147 


54,403 


5.29 


39 


ALD6 


37.7 


2.2 


0.664 


54,403 


5.29 


40 


ALD6 


6.6 


2.2 


0.664 


54,502 


6.20 


31 


ADE13 


6.3 


L5 


0.417 


54,543 


7,75 


25 


PYK1 


225.3 


101.8 


0.965 


54,543 


7.75 


26 


PYK1 


39.8 


101.8 


0.965 


55,221 


6.66 


146 


YEL071W 


16.3 


3.0 


0.244 


55,295 


4.35 


134 


PDU 


66.2 


14.1 


0.589 


55,364 


5.98 


24 


GLK1 


22.6 


6.0 


0.237 


55,481 


7.97 


118 


ATP1 


21.6 


2.2 


0.637 


55,886 


6.47 


28 


CYS4 


22,2 


NA< 


0.444 


56,167 


5.83 


33 


AR08 


14.3 


3.0 


0.324 


56,167 


5,83 


34 


AR08 


9.1 


3.0 


0.324 


56,584 


6.36 


20 


CYB2 


18.9 


NA ff 


0.259 


57,366 


5.53 


60 


FRS2 


2.3 


0.7 


0.451 


57.383 


5.98 


144 


ZWF1 


5.6 


0.7 


0.215 


57,464 


5.^9 


36 


THR'i 


£1.4 


3.7 


o.sob 


57,512 


150 


7 


SRV2 


6.5 


NA C 


0.260 


57,727 


4.92 


152 


VMA2 


33.7 


8.9 


0.546 


58,573 


6.47 


17 


ACH1 


4.4 


1.5 


0.327 


58,573 


6.47 


IS 


ACH1 


5.4 


1.5 


0.327 


61,353 


5.87 


21 


PDC1 


6.5 


200.7 


0.962 


61,353 


5.87 


22 


PDC1 


303.2 


200.7 


0.962 


61,353 


5.87 


23 


PDCI 


16.3 


200.7 


0.962 


61,649 


5.54 


38 


cent 


2.2 


1.5 


0.271 



Continued 
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TABLE 1— Continued 



Mot wt 




Spot no. 


YPD $cnc 
nume" 


Protein 
abundance 

cell) 


mRNA 
abundance 
(copies/cell) 


Codon 


61,902 


6.21 


101 


PDC5 


4.3 


NA C 


0.828 


62,266 


6.19 


16 


ICLi 


20.1 


NA C 


0.327 


62,862 


8.02 


19 


IJLV3 


53 


4.5 


0.548 


63.082 


6.40 


119 


PGM2 


22 


3,0 


0.402 


64,335 


5.77 


5 


PAB1 


30.4 


1.5 


0.616 


66,120 


5.42 


8 


STI1 


6.7 


0.7 


0.313 


66,120 


5.42 


9 


STI1 


6.4 


0.7 


0.313 


66,450 


5.29 


141 


SSB2 


7.0 


NA* 


0.880 


66,450 


5,29 


142 


SSB2 


2.3 


NA"" 


0.880 


66,456 


5.23 


10 


SSB1 


64.5 


79.5 


0.907 


66.456 


5-23 


11 


SSBl 


59.0 


79.5 


0.907 


66,456 


5.23 


12 


SSB1 


13.7 


79.5 


0.907 


68,397 


5.82 


82 


LEU4 


3.1 


3.0 


0.407 


69,313 


4.90 


13 


SSA2 


24.3 


18.6 


0.892 


69,313 


4.90 


14 


SSA2 


77.1 


18.6 


0.892 


74,378 


8.46 


15 


YKIXJ29C 


2.8 


3.7 


0.353 


75,396 


5.82 


6 


GRSi 


5.5 


7.4 


0.500 


85,720 


6.25 


1 


MbT6 


2.0 


NA e 


0.772 


85,720 . 


6.25 


2 


MET6 


10.9 


NA C ' 


U.772 


85,720 


6.25 


3 


MET6 


1.4 


NA C 


0.772 


93,276 


6.11 


131 


EFT1 


17.9 


41.6 


0.890 


93,276 


6.11 


132 


EFT1 


5.7 


41.6 


0.890 


102.064' 


6.61* 


94 


ADE3 


4.8 


5.2 


0.423 


107,482* 


5.33 e 


4 


MCM3 


2.7 


NA' : 


0.240 



* YPD gene names arc available from the YPD website (39). 
6 NA, calculation could not be performed orwis not available. 
c mRNA data inconclusive or NA. 

No methionines in predicted ORF; therefore, protein concentration was not 
determined. 

* Measured molecular weight or pi did not match theoretical molecular weight 
or pi. 



Protein quantitation. [ 33 S]mctbionine-labeled gels were exposed to X-ray film 
overnight, and theii the silver stain anil film wero uswJ to excise 156 spots of 
varying intensities, molecular weights, and pis. The excised spots were placed la 
0.6-ml microcentrifuge tabes, and scintillation cocktail (100 jtl) was added. The 
samples were vortexed and counted. In addition, two parallel gels were electro - 
blotted to polyvinyluleue difluoride membranes. The membranes were exposed 
to X-iay film, and four intense single spots were excised from each membrane 
and subjected to amino acid analysis. For these four spots, a mean of 209 £ 4 
cpm/puml o/pioteia/racthionine was found. This number was used to quantitate 
all remaining spots in conjunction with the number of methionines present in the 
protein. 

To ensure that proteins were labeled to equilibrium, parallel 2D gels were 
prepared and run on yeast mctabolicuUy labeled for 1, 2, 6, or 18 h. The 
corresponding 156 spots were excised from each gel, and radioactivity was mea- 
sured by liquid sciatillution counting /or each spot. Calculated protein levels were 
highly reproducible for all time points measured after 1 h. 

Calculation of codon bias and predicted half-life. Codon bias values were 
extracted from the YPD spreadsheet (17). Protein half-lives were calculated 
based on the N-end rule (33). When the N- terminal processing was not known 
experimentally, it was predicted bused on the affinity of methionine mninopep- 
tidase(3l). 

RESULTS 

Characteristics of proteome approach. Nearly every facet of 
proteome analysis hinges on (he unambiguous identification of 
large .numbers of , expressed proteins in cells. Sevsrai tech- 
niques have been described previously for the identification of 
proteins separated by 2DE, including N-termina] and internal 
sequenciug (1, 2), amino acid analysis (38), and more recently 
mass spectrometry (25). We utilized techniques based on mass 
spectrometry because they afford the highest levels of sensitiv- 
ity and provide unambiguous identification. The specific pro- 
cedure used is schematically illustrated in Fig. 1 and is based 
on three principles. First, proteins are removed from the gel by 



proteolytic in-gel digestion, and the resulting peptides are sep- 
arated by on-line capillary high-performance liquid chromatog- 
raphy. Second, the eluting peptides are ionized and detected, and 
the specific peptide ions are selected and fragmented by the 
mass spectrometer. To achieve this, the mass spectrometer 
switches between the MS mode (for peptide mass identifica- 
tion) and the MS/MS mode (for peptide characterization and 
sequencing). Selected peptides are fragmented by a process 
called collision-induced dissociation (CID) to generate a tan- 
dem mass spectrum (MS/MS spectrum) that contains the pep- 
tide sequence information. Third, individual CID mass spectra 
are then compared by computer algorithms to predicted spec- 
tra from a sequence database. This results in the identification 
of the peptide and, by association, the protein(s) in the spot. 
Unambiguous protein identification is attained in a single anal- 
ysis by the detection of multiple peptides derived from the 
same protein. 

Protein identification. Yeast total cell protein lysate (40 jig), 
metabolically labeled with [ 35 S]methionine, was eiectro- 
phorcticaily separated by isoelectric focusing in the first dimen- 
sion and by SDS-10% poly acryl amide gel electrophoresis in 
the second dimension. Proteins were visualized by silver stain- 
ing and by autoradiography. Of the more than 1,000 proteins 
visible by silver staining. 156 spots were excised from the gel 
and subjected to in-gel tryptic digestion, and the resulting 
peptides were analyzed and identified by microspray LC- 
MS/MS techniques as described above. The proteins in this 
study were all identified automatically by computer software 
with no human interpretation of mass spectra. They are indi- 
cated in Fig. 2 and detailed in Table 1. 

The CID spectra shown in Fig. 3 indicate that the quality of 
the identification data generated was suitable for unambiguous 
protein identification. The spectra represent the amino acid 
sequences of tryptic peptides NSGDIVNLGSIAGR (Fig. 3A) 
and FAVGAFTDSLR (Fig. 3B). Both peptides were derived 
from protein S57593 (hypothetical protein YMR226C). which 
migrated to spot 114 (molecular weight, 29,156; pi, 6.59) in the 
2D gel in Fig. 2. Five other peptides from the same analysis 
were also computer matched to the same protein sequence. 

Protein and mRNA quantitation. For the 156 genes investi- 
gated, the protein expression levels ranged from 2,200 (PGM2) 
to 863,000 (TDHZ/TDH3) copies/cell. The levels of mRNA for 
each of the genes identified were calculated from SAGE fre- 
quency tables (35). These tables contain the mRNA levels for 
4,665 genes in yeast strain YPH499 grown to mid-log phase in 
YPD medium on glucose as a carbon source. In some in- 
stances, the mRNA levels could not be calculated for reasons 
stated in Materials and Methods. For the proteins analyzed in 
this study, mean transcript levels varied from 0.7 to 473 copies/ 
cell 

Selection of the sample population for mRNA-protein ex- 
pression level correlation. The protein spots selected for iden- 
tification were selected from spots visible by silver staining in 
the 2D gel. An attempt was made not to include spots where 
overlap with other spots was readily apparent. The number of 
proteins identified was 156 (Table 1). Some proteins migrated 
to more than one spot (presumably due to differential protein 
processing or modifications), and protein ieveis nom theie 
spots were calculated by integrating the intensities of the dif- 
ferent spots. The 156 protein spots analyzed represented the 
products of 128 different genes. Genes were excluded from the 
correlation analysis only if part of the data set was missing; i.e., 
genes were excluded if (i) no mRNA expression data were 
available for the protein or putative SAGE tags were ambig- 
uous, (ii) the amino acid sequence did not contain methiomne, 
(iit) more than a single protein was conclusively identified as 
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NSGDIVNLGSIAGR 
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1200 
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FIG. 3. Tandem mass (MS/MS) spectra resulting from analysis of a single spot on a 2D gel. The first quadrupole selected a single mass-to-charge ratio (m/z) of 687.2 
(A) or 592.6 (B), while the collision cell was filled with argon gas, and a voltage which caused the peptide to undergo fragmentation by CJD was applied. The third 
quadrupole scanned :he mass racgts from SO to 1,400 mjz. The- computer prcgnira Seauest (ft) wns utilized to match qpecfrti to amino ac'diio-^nce by darabzsft 

searching. Both spectra matched peptides from the same protein,* S57593 (yeast hypothetical protein YMR226C). Five other peptides from the same analysis were 
matched to the same protein. 



migrating to the same gel spot, or <iv) the theoretical and 
observed pis and molecular weights could not be reconciled. 
After these criteria were applied, the number of genes used in 
the correlation analysis was 106. 



Codon bias and predicted half-lives. Codon bias is thought 
to be an indicator of protein expression, with highly expressed 
proteins having large codon bias values. The codon bias distri- 
bution for the entire set of more than 6,000 predicted yeast 



1726 GYGI ET AL 



Men.. Q?ix. Biol. 



gene ORFs is presented in Fig. 4A. The interval with the 
largest frequency of genes is between the codon bias values of 
0.0 and 0.1. This segment contains more than 2,500 genes. The 
distribution of the codon bias values of the 128 different genes 
found in this study (alt protein spots from Fig. 2) is shown in 
Fig. 4B, and protein half-Jives (predicted from applying the 
N-end rule [33] to the experimentally determined or predicted 
protein N termini) are shown in Fig. 4C. No genes were iden- 
tified with codon bias values less than 0.1 even though thou- 
sands of genes exist in this category. In addition, nearly all of 
the proteins identified had Ions predicted half-lives (greater 
than 30 h)> 

Correlation of mRNA and protein expression levels. The 
correlation between mRNA and protein levels of the genes 
selected as described above is shown in Fig. 5. For the entire 
group (106 genes) for which a complete data set was gener- 
ated, there was a general trend of increased protein levels 
resulting from increased mRNA levels. The Pearson product 
moment correlation coefficient for the whole data set (106 
genes) was 0.935. This number is liighly biased by a small 
number of genes with very large protein and message levels. A 
more representative subset of the data is shown in the inset of 
Fig. 5. It shows genes for which the message level was below 10 
copies/cell and includes 69% (73 of 106 genes) of the data used 
in the study. The Pearson product moment correlation coeffi- 
cient for this data set was only 0.356. We also found that levels 
of protein expression coded for by mRNA with comparable 
abundance varied by as much as 30-fold and that the mRNA 
levels coding for proteins with comparable expression levels 
varied by as much as 20-fold. 

The distortion of the correlation value induced by the un- 
even distribution of the data points along the x axis is further 
demonstrated by the analysis in Fig. 6. The 106 samples in- 
cluded in the study were ranked by protein abundance, and the 
Pearson product moment correlation coefficient was repeat- 
edly calculated after including progressively more, and higher- 
abundance, proteins in each calculation. The correlation values 
remained relatively stable in the range of 0.1 to 0.4 if the 
lowest-expressed 40 to 95 proteins used in this study were 
included. However, the correlation value steadily climbed by 
the inclusion of each of the 11 very highly expressed proteins. 

Correlation of protein and mRNA expression levels with 
codon bias. Codon bias is the propensity for a gene to utilize 
the same codon to encode an amino acid even though other 
codons would insert the identical amino acid in the growing 
polypeptide sequence. It is further thought that highly ex- 
pressed proteins have large codon biases (3). To assess the 
value of codon bias for predicting mRNA and protein levels in 
exponentially growing yeast cells, we plotted the two experi- 
mental sets of data versus the codon bias (Fig. 7). The distri- 
bution patterns for both mRNA and protein levels with respect 
to codon bias were highly similar. There was high variability in 
the data within the codon bias range of 0.8 to 1.0. Although a 
large codon bias generally resulted in higher protein and mes- 
sage expression levels, codon bias did not appear to be predic- 
tive of either protein levels or mRNA levels in the cell. 

DISCUSSION 

The desired end point for the description of a biological 
system is not the analysis of mRNA transcript levels alone but 
also the accurate measurement of protein expression levels and 
their respective activities. Quantitative analysis of global 
mRNA levels currently is a preferred method for the analysis 
of the state of cells and tissues (11). Several methods which 
either provide absolute mRNA abundance (34, 35) or relative 



3000 t 



8 

00 



1 



1000 




5 ? 1 



B 



3 3 2 22 2 S 

Codon Bias 



8 

'-M 

o 

I 

5 




Codort Bias 



5 



2 



150 i 
125 
100 - 

so - 

25 
0 



$ .\'.:':. : 0+S$ ^ 



Short Long 

Predicted Half* life 

FIG. 4. Current pioteome analysis technology utilizing 2DE without preen- 
ricUmcnt samples mainly highly expressed and long-lived proteins. Geues encod- 
ing highly expressed proteins generally have large codon bias values. (A) Distri- 
bution of the yeast genome (more than 6,000 gene&) based on codoa bias. The 
interval with the largest frequency of genes is 0.0 to 0.1, with more than 2,500 
genes. (B) Distribution of the genes from identified proteins in Ibis study based 
on codon bias. No genes with codon bias values less than 0. 1 were detected in this 
study, (C) Distribution of id entitled proteins in this study based on predicted 
half-life (estimated by N-end rule). 



mRNA levels in comparative analyses (20, 27) have been de- 
scribed elsewhere. The techniques are fast and exquisitely sen- 
sitive and can provide mRNA abundance for potentially any 
expressed gene. Measured mRNA levels are often implicitly or 
explicitly extrapolated to indicate the levels of activity of the 
corresponding protein in the cell. Quantitative analysis of pro- 
tein expression levels (protcome analysis) is much more time- 
consunung because proteins are analyzed sequentially one by 
one and is not general because analyses are limited to the 
relatively highly expressed proteins. Proteome analysis does, 
however, provide types of data that are of critical importance 
lor the description of the state of a biological system and that 
are not readily apparent from the sequence and the level of 
expression of the mRNA transcript. This study attempts to 
examine the relationship between mRNA and protein expres- 
sion levels for a large number of expressed genes in cells 
representing the same state. 

Limits in the sensitivity of current protein analysis technol- 
ogy precluded a completely random sampling of yeast proteins. 
We therefore based the study on those proteins visible by silver 
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FIG. 5. Correction between protein and mRNA levels for 106 genes la yeast growing «t log phase with glucose as a carbon source. mRNA and protein levels were 
calculated as described in Materials and Methods. The data represent a population of genes with protein expression levels visible by silver staining on a TP gel chosen 
to include the entire range of molecular weights, isoelectric focusing points, and staining intensities. The inset shows the low-end portion of the main figure, it contains 
69% of the original data set. The Pearson product moment correlation for the entire data set was 0.935. The correlation for the inset containing 73 proteins (69%) was 
only 0.356. 



staining on a 2D gel. Of the more than 1,000 visible spots, 156 
were chosen to include the entire range of molecular weights, 
isoelectric focusing points, and staining intensities displayed on 
the 2D protein pattern. The genes identified in this study 
shared a number of properties. First, all of the proteins in this 
study had a codon bias of greater than 0.1 and 93% were 
greater than 0.2 (Fig. 48). Second, with few exceptions, the 
proteins in this study had long predicted halMives according to 
the N-end rule (Fig. 4C). Third, low-abundance proteins with 
regulatory functions such as transcription factors or protein 
kinases were not identified. 

Because the population of proteins used in this study ap- 
pears to be fairly homogeneous with respect to predicted half- 
life and codon bias, it might be expected that the correlation of 
the mRNA and protein expression levels would be stronger for 
this population than for a random sample of yeast proteins. We 
tested this assumption by evaluating the correlation value if 
dilferent subsets of the available data were included in the 
calculation. The 106 proteins were ranked from lowest to high- 
est protein expression level, and the trend in the correlation 
value was evaluated try progressively including more cf the 
higher-abundance proteins in the calculation (Fig. 6). The cor- 
relation value when only the lower-abundance 40 to 93 pro- 
teins were examined was consistently between 0.1 and 0.4. If 
the 11 most abundant proteins were included, the correlation 
steadily increased to 0.94. We therefore expect that the corre- 
lation for all yeast proteins or for a random selection would be 
less than 0.4. The observed level of correlation between 
mRNA and protein expression levels suggests the importance 



of posttranslational mechanisms controlling gene expression. 
Sucji mechanisms include translational control (15) and con- 
trol of protein half-life (33). Since these mechanisms are also 
active in higher eukaryotic cells, we speculate that there is no 
predictive correlation between steady-state levels of mRNA 
and those of protein in mammalian cells. 

Like other large-scale analyses, the present study has several 
potential sources of error related to the methods used to de- 
termine mRNA and protein expression levels. The mRNA 
levels were calculated from frequency tables of SAGE data. 
Tins method is highly quantitative because it is based on actual 
sequencing of unique tags from each gene, and the number of 
times that a tag is represented is proportional to the number of 
mRNA molecules for a specific gene. This method has some 
limitations including the following: (i) the magnitude of the 
error in the measurement of mRNA levels is inversely propor- 
tional to the mRNA levels, (it) SAGE tags from highly similar 
genes may not be distinguished and therefore are summ ed, (iii) 
some SAGE tags are from sequences in the 3' untranslated 
region of the transcript, (iv) incomplete cleavage at the SAGE 
tag site by the -restriction enzyme can result in two tags repre- 
senting one mRNA, and (v) some transcripts actually do not 
generate a SAGE tag (34, 35). 

For the SAGE method, the error associated with a value 
increases with a decreasing number of transcripts per ceil. The 
conclusions drawn from tiiis study are dependent on the qual- 
ity of die mRNA levels from previously published data (35). 
Since more than 65% of the mRNA levels included in this 
study were calculated to 10 copies/cell or less (40% were less 
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FIG, 6. Effect of highly abundant proteins on Pearson product moment correlation coefficient for mRNA and protein abundance in yeast. The set of 106 genes was 
ranked according to protein abundance, and the correlation value was calculated by including the 40 lowest-abundance genes and then progressively including the 
remaining 66 genes in order of abundance. The correlation value climbs as the final 11 highly abundant proteins are included. 



than 4 copies/cell), the error associated with these values may 
be quite large. The mRNA levels were calculated from more 
than 20,000 transcripts. Assuming that the estimate of 15,000 
mRNA molecules per cell is correct (16), this would mean that 
mRNA transcripts present at only a single copy per cell would 
be detected 72% of the lime (35). The mRNA levels for each 
gene were carefully scrutinized, and only mRNA levels for 
which a high degree of confidence existed were included in the 
correlation value. 

Protein abundance was determined by metabolic radiolabel - 
ing with ['^SJmethionine. The calculation required knowledge 
of three variables: the number of methionines in the mature 
protein, the radioactivity contained in the protein, and the 
specific activity of the radiolabel normalized per methionine. 
The number of methionines per protein was determined from 
the amino acid sequence of the proteins identified by tandem 
mass spectrometry. For some proteins, it was not known 
whether the methionine of the nascent polypeptide was pro- 
cessed away .The N termini of those proteins were predicted 
based on' the specificity of methionine aminopepiidose (31). Jf 
the N-terminal processing did not conform to the predicted 
specificity of processing enzymes, the calculation of the num- 
ber of methionines would be ajfected. This discrepancy would 
aifect most the quantitation of a protein with a very low num- 
ber of methionines. The average number of calculated methi- 
onines per protein in this study was 7.2. We therefore expect 
the potential for erroneous protein quantitation due to un- 
usual N-terminal processing to be small. 



The amount of radioactivity contained in a single spot might 
be the sum of the radioactivity of comigrating proteins. Be- 
cause protein identification was based on tandem mass spec- 
trometric techniques, comigrating proteins could be identified. 
However, comigrating proteins were rarely detected in this 
study, most likely because relatively small amounts of total 
protein (40 jjtg) were initially loaded onto the gels, which re- 
sulted in highly focused spots containing generally 1 to 25 ng of 
protein. Because of the relatively small amount loaded, the 
concentrations of any potentially comigrating protein would 
likely be below the limit of detection of the mass spectrometry 
technique used in this study (1 to 5 ng) and below the limit of 
visualization by silver staining (1 to 5 ng). In the overwhelming 
majority of the samples analyzed, numerous peptides from a 
single protein were detected. It is assumed that any comigrat- 
ing proteins were at levels too low to be detected and that their 
influence in the calculation would be small. 

The specific activity of the radiolabel was determined by 
relating the precise amount of protein present in selected spots 
of a parallel gel, determined by quantitative ammo acid 
composition analysis, to the number of methionines present in 
the sequence of those proteins and the radioactivity deter- 
mined by liquid scintillation counting. It is possible that the 
resulting number might be influenced by unavoidable losses 
inherent in the amino acid analysis procedure applied. Because 
four different proteins were utilized in the calculation and the 
experiment was done in duplicate, the specific activity calcu- 
lated is thought to be highly accurate. Indeed, the specific 
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FIG. 7. Relationship between codon bias and protein and mRNA levels in this study. Yeast mRNA and protein expression levels were calculated as described in 
Materials and Methods. The data represent the same 106 genes as in Fig. 5. 



activities calculated for each of the four proteins varied by less 
than 10%. Any inconsistencies in the calculation of the specific 
activity wouJd result in differences in the absolute levels calcu- 
lated but not in the relative numbers and would therefore not 
influence the correlation value determined. 

The protein quantitative method used eliminates a number 
of potential errors inherent in previous methods for the quan- 
titation of proteins separated by 2DE, such as preferential 
protein staining and bias caused by inequalities hi the number 
of radiolabeled residues per protein. Any 2D gel-based method 
of quantitation is complicated by the fact that in some cases the 
translation products of the same mRNA migrated to different 
spots. One major reason is posttranslational modification or 
processing of the protein. Also, artifactual proteolysis during 
cell lysis and sample preparation can lead to multiple resolved 
forms of the protein. In such cases, the protein levels of spots 
coded for by the same mRNA were pooled. In addition, the 
existence of other spots coded for by the same mRNA that 
were not analyzed by mass spectrometry or that were below the 
Jimit of detection for silver staining cannot be ruled out How- 
ever, since this study is based on a class of highly expressed 
proteins, the presence of undetected minor spots below silver 
staining scfuudvity corresponding to a protein analyzed in the 
study would generally cause a relatively small error in protein 
quantitation. 

Codon bias is a measure of the propensity of an organism to 
selectively utilize certain codons which result in the incorpo- 
ration of the same amino acid residue in a growing polypeptide 
chain. There are 61 possible codons that code for 20 amino 
acids. The larger the codon bias value, the smaller the number 
of codons that are used to encode the protein (19), It is 



thought that codon bias is a measure of protein abundance 
because highly expressed proteins generally have large codon 
bias values (3, 13). 

Nearly all of the most highly expressed proteins had codon 
bias values of greater than 0.8. However, we detected a number 
of genes with high codon bias and relative low protein abun- 
dance (Fig. 7). For example, the expressed gene with both the 
second largest protein and mRNA levels in the study was 
EN02YEAST (775,000 and 289.1 copies/cell, respectively). 
ENOIJYEAST was also present in the gel at much lower 
protein and mRNA levels (44,200 and 0.7 copies/cell, respec- 
tively). The codon bias values for EN02 and ENOl are similar 
(0.96 and 0.93, respectively), but the expression of the two 
genes is differentially regulated. Specifically, ENOl^YEAST is 
glucose repressed (6) and was therefore present in low abun- 
dance under the conditions used. Other genes with large codon 
bias values that were not of high protein abundance in the gel 
include EFT1, TIF1, HXK2, GSP1, EGD2, SHM2, and TALI. 
We conclude that merely determining the codon bias of a gene 
is not sufficient to predict its protein expression level. 

Interestingly, codon bias appears to be an excellent indicator 
of the boundaries of current 2D gel proteome analysis tech- 
nology. There are thousands of genes with expressed rnRNA 
and likely expressed protein with codon bias values less than 
0.1 (Fig. 4A). In this study, we detected none of them, and only 
a very small percentage of the genes detected in this study had 
codon bias values between 0.1 and 0.2 (Fig. 4B). Indeed, in 
every examined yeast proteome study (5, 7, 13, 28) where the 
combined total number of identified proteins is 300 to 400, this 
same observation is true. It is expected that for the more 
complex cells of higher eukaryotic organisms the detection of 
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low-abundance proteins would be even more challenging than 
for yeast. This indicates that highly abundant, long-lived pro- 
teins are overwhelmingly detected in proteome studies. If pro- 
teome analysis is to provide truly meaningful information 
about cellular processes, it must be able to penetrate to the 
level of regulatory proteins, including transcription factors and 
protein kinases. A promising approach is the use of narrow- 
range focusing gels with immobilized pH gradients (IPG) (23). 
This would allow for the loading of significantly more protein 
per pH unit covered and also provide increased resolution of 
proteins with similar electrophoretic mobilities. A standard pH 
gradient in an isoelectric focusing gel covers a 7-pH-unit range 
(pH 3 to 10) over 18 cm. A narrow-range focusing gel might 
expand the range to 0.5 pH units over 18 cm or more. This 
could potentially increase by more than 10- fold the number of 
proteins that can be detected. Clearly, current proteome tech- 
nology is incapable of analyzing low-abundance regulatory pro- 
teins without employing an enrichment method for relatively 
low-abundance proteins. In conclusion, this study examined 
the relationship between yeast protein and message levels and 
revealed that transcript levels provide little predictive value 
with respect to the extent of protein expression. 
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High-throughput technologies, such as proteomic screening and ONA micro-arrays, produce vast 
amounts of data requiring comprehensive analytical methods to decipher the bioKy^im 
results. One approach would be to manually search the biomedical literature; howeve^ Ms Jou" Z 
an arduous task. We developed an automated Hterature-mlning tool. Ivn^toetQ^l^ 
comprehensively summarizes and estimates the relative strengths o all TTman aeSe-di^!? 

T'TZ 1 " T'^ ^ MedGen8 ' we « novTmicro^ZeSTn oS 

comparing breast cancer end normal breast tissue in the context of existing lux^edWwe foun^S 
correlation between the strength of the literature associauon and the rr^te^M^cZ • 
expression level when considering changes as high as S-foW; however. accent ^rmtoTio^as 
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Introduction 

At Its current pace, the accumulation of biomedical literature 
outpace the ability of most researchers and clinicians to stay 
abreast of their own Immediate fields let alone cover a broader 
range of topics. For example, to follow a single disease, e.g.. 
breast cancer, a researcher would have had to scan 1 30 different 
Journals and read 27 papers per day In 199&.> This problem 1$ 
accentuated with high-throughput technologies such as DNA 
micro-arrays and proteomlcs. which require the analysts of 
large datasets Involving thousands of genes, many of which are 
unfamiliar to a particular researcher. In any mlcroarray experi- 
ment, thousands of genes may demonstrate statistically sig- 
nificant expression changes, but only a fraction of these may 
be relevant to the study, The ability to Interpret these datasets 
would be enhanced If they could be compared to a compre- 
hensive summary of what Is known about all genes. Thus, there 
Is a need to summarize existing knowledge in a format that 
allows for the rapid analysis or associations between genes and 
diseases or other specific biological concepts. 

One solution to this problem Is to compile structured digital 
resources, such as the Breast Cancer Gene Database' and the 
Tumor Cene Database. 1 However, as these resources are hand- 
curated, the labor-Intensive review process becomes a rate- 
limiting step in the growth of the database. As a result, these 

• To whom cofiospo*W*ho^ jlabaa^hmi.hwyanl.etfu. 
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databases have a limited scale and the genes axe not selected 
In a systematic fashion. 

An alternative approach Is automated text mining: a method 
which involves automated information extraction by searching 
documents for text strings and analyzing their frequency and 
context. This approach has been used successfully In several 
instances for biological applications, In most cases, it has been 
applied to extract Information about the relationships or 
nteraaions that proteins or genes have with one another. In 
the literature or by functional annotation.'-' Thus far few 
publication have appiled text-mining to examine the global 
2SS^? ••^f' »nd dlNMM. a? M lnl M ir«t ail. 

of genes and their predicted chromosomal locations In order 
to Identity genes linked to Inherited disorders * 

^ te • "^tfoW understanding of disease develop- 
menutv^ 

all possiblegene-dlsease relationships. Including btocEca* 
2S^? L . ^ harmacolo «^ epidemiological, as well as 
genetic This Information would enable comprehensive com- 
parisons between large experimental datasets and existing 
^ow ledge In the literature. This would accomplish two things. 
ttrsl.lt would serve to validate experiments by demonstrating 

n^ZZI^f"^ " prcdlcted Secon * * 
«SJS m? S WWch 86063 ** c °™**ated ^ literature 
and which genes are novel In a given context We have utilized 
a computational approach to literature mining to produce a 
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comprehensive set of gene-disease relationships. In addition, 
we have developed a novel approach to assess the strength of 
each association based on tlie frequency or citation and co- 
cltatlon. We applied this tool to help Interpret the data from a 
large micro-array gene expression experiment comparing 
normal and cancerous breast tissue. 

Methods 

Med Gene Database. MedGene Is a relational database, stor- 
ing disease and gene Information from NCBI. te« mining re- 
sults, statistical scores, and hyperlinks to the primary lit- 
erature. MedGene has a web-based user interlace for users to 
query the database (httpr//hlpseq.medJiarvart*dii/MedCeneO. 

Text Mining Algorithms. MeSH files were downloaded from 
the McSH web site at NLM (Nation Library of Medicine) (httpV/ 
www.rum.nlh.gov/mesh/meshhome.html) and human disease 
categories were selected. LocusUnk flies wore downloaded from 
the LocusUnk web site at NCBI (http://www.ncbl.nlh.gov/ 
LocusUnk/). Official/preferred gene symbol, official/preferred 
gene name, and gene alternative symbols and names, all 
relevant annotations and URLs for each LocusUnk record, were 
collected. Cene search terms were used for literature searching 
and Included all qualified gene names, gene symbols, and gene 
family terms. Primary gene keys, predominantly qualified gene 
family terms and gene official/preferred symbols, were used 
to Index Medline records. If the official/preferred gene symbols 
did not meet the standards to be an Index, then qualified gene 
ofnclal/prefemsd names were used. A local copy or Medline 
records (up to July, 2002) was pre-selected. 

A JAVA module examined the MeSH terms and then Indexed 
each Medline record with the appropriate disease terms A 
separate JAVA module was used to examine the titles and 
abstracts for gene search terms and then to Index the gene- 
related Medline records with the relevant primary gene key(s), 
Statistical Methods. For every gene and disease pair, we 
counted records that were Indexed for both gene and disease 
(double positive Wis), for disease only (disease single hits) for 
gene only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table 
On the basis or the contingency table-framework. we applied 
different statistical method, to estimate the strength of gene- 
disease relationships and evaluated the results. These me !hod. 
Included cM-square analysis. Fisher', exact probabilities, rela- 
tive risk of gene, and relative risk or disease" (httpV/ 
, h ^ ^ ^ *«>MriGene^ in addition, we computed 
Ihe product of frequency', which Is the product of the 
proportion of disease/gene double hits to disease single hits 
and the proportion of disease/gene double hits to gene single 
hits To obtain a normal dlstrlbutloa we transformed all the 
statistical scores using the natural logarithm. We selected the 
og of the product of frequency (LPF) to validate MedGene and 
to use for the analysis with the micro-array data. Spearman 

relationship between LPF and mlcroarray fold change In 
expression level. * 

Global Analysis. Diseases with at least SO related genes were 
selected for clustering analyst, and the LPF score, were 
normalized with total score for each disease. Hierarchical 
clustering was done with the 'Cluster' software and the 
clustering result was visualized using TreeVleW (http// 
rana.lbl.gov/ElsenSoltwaro.htm). 
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Breast Tissue Micro- Arrays. Elghty-nlne breast cancer 
samples (78% ER-posltlve) and 7 normal breast tissue samples 
were selected from the Harvard Breast SPORE frozen tissue 
repository and were representative of the spectrum of histo- 
logical types, grades, and hormone receptor Immuno-pheno- 
types of breast cancer. Biotlnylated cRNA, generated from the 
total RNA extracted from the bulk tumor, was hybridized to 
Affymetrtx U95A oligo- nucleotide micro-arrays. These micro- 
arrays consist of 12 400 probes, which represent approximately 
9000 genes. Raw expression values were obtained using GENE- 
CHIP software from Aflymetrlx, and then further analyzed using 
the DNA-Chlp Analyzer (dChlp) custom software 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene-disease associations In the literature 
we Hist compiled complete lists for human diseases and human 
genes. To Index all Medline records that were relevant to 
humaivdlseases. the Medical Subject Heading (MeSH) Index 
or Medline records was utilized. MeSH Is a controlled medical 
vocabulary from the National Library of Medicine and consists 
of a set of terms or subject headings that are arranged In both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to each with 
software assistance.*" Twenry-three human disease category 
headings along with all of their child term, (see the Supporting 
Information. Supplemental Table 1, or visit httoy/hlpsea 
med.l»rvardedu^edCene/ P ubllcatlorycrable 1 htmQwei 
selected from the 2002 MeSH Index creating a list of 4033 
human diseases. 

No Index comparable to the MeSH Index exists for gene,, 
and thus. It was necessary to apply a string search algorithm 
for gene names or symbols found In Medline text. A complete 
list or genes, gene names, gene symbols, and frequently used 

n™£ T, S T! C ° HeC,ed fr0m the LocuiL,nk ****** at 
NCBI,"-" which contains 53 259 Independent records keyed 
&y an official gene symbol or name (June 18*. 2002). For tha 
purposes orthlsstudy.no distinction was made between genes 
and I their gene products. Author, often use the same name for 
both differentiating the two only by the use of Italics, if at all 
for the Intended use or this study, this lack of distinction Is 
unlikely to have a large effect and may In fact be benSdal 
Initial attempts to search the literature using these lists 

S^l* $0urces of false < )OS,t,ve8 » nd *«" negatives 
(Table 1). False positives primarily arose when the searched 
term had other meanings, whereas false negatives arose from 
syntax discrepancies necessitating the development of niters 
to reduce these errors. The syntax Issues were readily handled 
by Including alternate syntax forma In the search terms. The 
fche positive cases, caused by duplicative and unrelated 
meanings for the terms, were more difficult to manage. Where 
possible, case sensitive string mapping reduced Inappropriate 
citations. In many cases, however, this was not suffldenVand 
the terms had to be eliminated entirely, thereby reducing the 
Mse positive rate but unavoidably under-representing some 

J%£*. PUrP ° SeS ° f data tracW '* a P" m *y Bene key was 
T^l^,, reprCSCnl a " W™**™ thai correspond to each 
gene. Medline records were Indexed with a primary gene key 
when any synonym for that key was found In the title or 
abstract. Case- insensitive string mapping was used for ,|| 
searches except as noted above. No additional weight was 



f 



"'Xnifysff of IBaia Using AdvMctd Utuntur* Mining 

Table 1. Systematic Sources of false Positives end Falsa Negative* in Unnitered Oat* 



research articles 



source of error 



rtype 



gene symbol/name false positive 

Is not unique 



gene symbol Is false positive 

unrelated abbreviation 

gene symbol/name false positive 

hats language meaning 

nonstandard syntax false negative 

unofficial gene name/syrnbo) false negative 

nonspeclfted gene name false negative 



example 



AM£-rnye!ln 
^ associated glycoprotein 
AWO»maIlgnancy-associated 
protein 

FA-pallid homdogue (mouse), 

pallldln (also abbrev. for Pennsylvania) 
M^VVlskoU'Aldrlch Syndrome 

(also the word "was") 
BAG*I Instead of BAC1 
/SJ Instead of TPS3 
estrogen receptor Instead of 
Estrogen receptor J 



filter solution 



eliminate this term 

eliminate thb term 

case-sensitive string search 

add dash term 

add all gene nicknames 

add family stem term 



fahe negate* are real relationships that arc uixlme»3^^V^Kr^ SSEJSl ^T.V ****** «^^««Wp» «rc nwtS and 
error. In general error rate, maA»d sans*M*£^^ 

added for multiple occurrences of a term or the co-occurrence 
of multiple synonyms for the same gene key. 

Medline records were searched with all qualified gene 
identifiers, such as the official/preferred gene symbol, the 
official/preferred gene name, all gene nicknames and all syntax 
variants. In situations where there are several members of a 
gene family or splice variants, some authors prefer to use a 
shortened gene family name, e.g„ estrogen receptor Instead of 
estrogen receptor I (£Sr?fl. creating a source of falsenegatlves. 
For this reason, gone family stem terms were created for all 
genes that have an alpha or numerical suffix (eg., IL2RA t TGF$ t 
BSRi etc.) and then used to search the literature The family 
stem terms were handled separately from the specific gene 
names so that It would be clear wlten linkages were made to 
the gene family versus a specinc member In that family. 

To Improve performance and accuracy, some pre-selectlon 
was applied to the records that were scanned First, review 
articles were eliminated to avoid redundant treatment of • 
citations. Second, non-English Journals were removed because 
the natural language filters were only relevant to English 
publications. Finally, Journals unlikely to contain primary data 
about gene-disease relationships were also removed (eg. Int 
/ Health Edui, Bedside Nurse, and / Health Eton), Together' 
2002) tf 3Ji edUWd ^ X1 m 22 1 Medllne locations duly 
Ranking the Relative Strengths of Gene^Diseas* Associa- 
tions, In total, there were 6J8 708 gene-disease co-citeUons, 
In which 16% (8297) of ail studied genes had been associated 
to a disease and 96% (3875) of all diseases had been associated 
to at least one gene To rank the relative strengths of gene 
disease relationships, we tested several different statistical 
methods and examined the results. With the exception of the 
relative risk estimates, the methods provided similar results 
with respect to the rank order or the gene-disease association 
strengths. However, after comparing the results to other 
databases and after consulting disease experts, the log of the 
product of frequency (LPF) was selected for further analysis 
because It gave the best results overall. 

Validation of MedCene, In developing this tool. It was 
Important to minimize the number of missed genes (false 
negatives) and miscalled genes (false positives). However. In 
situations when these goals were In conflict. Incluslvenass was 
prioritized. To determine the false negative rate In MedCene, 
breast cancer was used as a test case because It was associated 
with more genes than any other human disease and because 




Figure 1. Estimation of the false negative rate by comparison 

Tn^TTln^^ ThQ «nc*r4ta^ane, 
identified by MedGene were compered with those Ihted in 

S a ^°^ ?~ D*aba*(BCG),i GeneCerds 
(GQ« and Swlssprot" Genes were considered false negatives 
If they wer« represented In atleast one of these other databases 
end not In MedGene and their link to breast cancer wa7su£ 
potted by at least one literature reference. All literature references 
wereverinod by manual review to confirm their validity The 
Mar of genes In each database or shared by more than one 
database is indicated. The fatso negative rate was calculated S 
oenes missed at MedGene (2S)/total number of nonoverlappln^ 
genes In other databases (285). 

there were several public databases that link genes to breast 
cancer We compared the list of breast cancer-related genes 
from MedCene to these databases, illustrated In Figure l 
Among the 285 distinct breast cancer-related genes that were 
supported by at least one literature citation in these hand- 
curated databases, 26 were absent from MedCene. suggesting 
a false negative rate of approximately 996. To determine why 
these were missed all literature references for these genes (80 
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• papers) were reviewed manually (see the Supporting Informa- 
tion, Supplemental Tabic 2. or visit http://hlpseq.med. 
harvard.edu/MedGene/publlcatIon/s_Table 2htmD- Among 
these papers, most false negatives were caused by nonstandard 
gene terms or gene terms eliminated by our specificity filters. 
Few genes were missed because they were only mentioned In 
review papers (0.4%) or they appeared only In the body of the 
manuscript but not the abstract or title (1.1%). Of note, 
MedCene Identified approximately 2000 additional breast 
cancer-related genes not listed Jn any other database. 

To assess the false positive error rate, two complementary 
approaches were used: a detailed analysis of one disease and 
a global examination of 1000 diseases. The detailed approach 
examined the false positive error rate and Its sources, whereas 
the globar approach tested whether the overall results made 
biomedical sense. 

Using the LPP, 1467 genes related to prostate cancer were 
assembled In rank order. We then retrieved approximately 300 
Medline records each for the highest ranked 100 and the lowest 
ranked 200 genes and manually reviewed the titles and 
abstracts to determine the verity of the association. Nearly 80% 
of the highest ranked 100 genes fell Into one of the five 
categories that reflect meaningful gene-disease relationships 
(see the Supporting Information, Supplemental Table 3. or visit 

http://hlpseq.med.harvard.edu/MedGene/publlcatlon/ 
sjable 3.htm0. Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true relationships. Of the 600 records 
reviewed, there were only two In which the association between 
the gene and the disease was described as negative. Both were 
genes with very low scores, in both cases, the authors did not 
argue the absence of any relationship, but rather that a 
particular feature or the gene or protein was not shown to be 
related to human prostate cancer. 

The coincidence of some gene symbols with medical ab- 
breviatlons, chemical abbreviations and biological abbrevia- 
tions resulted In most of the false posidves (see the Supporting 
Information, Supplemental Table 4, or visit http://hlpse- 
q.med.harvard.edu/MedCenc/publlcatlon/s_Tabie 4.hUn|) ( em- 
phasizing the Importance of the filters that were added In the 
search algorithm (Table I). Without the filters, the false positive 
rate more than doubled, and the false negative rate rose 
dramatically (data not shown). For example, among the papers 
about breast cancer, there were only 12 Medline records that 
referred to ESRl and 10 to ESRl whereas almost 2000 papers 
mentioned estrogen receptor without specifying ESRl or ESJ&, 
this latter group was detected by the family stem term filler.' 

To further validate these results, a global analysis of the gen* 
disease relationships described by MedCene was performed 
For this experiment, it was reasoned that the more closely 
related the diseases are to one another, the more they will be 
related to the same gene sets. Thus, If the relationships defined 
by MedCene accurately reflected the literature, then an unsu- 
pervised hierarchical clustering of the gene data should group 
diseases In a manner consistent with common medical think- 
ing. Conversely, ir the clustered diseases do not make sense 
biologically or medically, it may reflect excessive false positives, 
false negatives, or Inappropriate scoring of the data. 

To execute this experiment, the gene sets and the corre- 
sponding LPF values for 1000 randomly selected diseases (each 
with at least 50 gene relationships) were used as a dataset for 
clustering the diseases. A review of the results showed that the 
resulting disease clusters were indeed logical based upon 
common medical knowledge (see the Supporting information, 
Jpwnai of Prrteome Research • Vot 2, No. < aow 
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Supplemental Figure 1, or visit http^/hlpseq.med.harvardedu/ 
MedCene/pubilcation/sJigure l.htmO. For example, in one 
such cluster shown In Figure 2, diabetes and its complications 
grouped together and were also closely linked to diseases 
associated with starvation stales. 

The number of genes associated with a given disease can 
be estimated by adjusting the MedCene number up by the false 
negative rate and down by the false positive rate (**2&% 
on average). Using this, the average disease has 103.7 ± 45,3 
(mean ± s.d) genes associated with It* although the range Is 
quite broad with 2359 genes related to breast cancer, 2122 
genes related to lung cancer and no genes related to a number 
of diseases. 

Applying UedGene to the Analysis of Large Datasets, Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunity to analyze data obtained from 
a high-throughput experiment. We compared the MedCene 
breast cancer gene list to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro-array analysis identified 
2286 genes that had greater than a t fotd difference In mean 
expression level between breast cancer samples and normal 
breast samples. Using MedCene, we sorted the 2286 genes into 
four classes: 555 genes directly linked to breast cancer in the 
literature by gene term search (first-degree association by gene 
name); 328 genes directly linked by family term search (first- 
degree association by family terra); 1021 genes linked to breast 
cancer only through other breast cancer genes (second-degree 
association); end 505 genes not previousry associated with 
breast cancer. (See the Supporting Information, Supplemental 
Figure 2, or visit http://hlpseq.med.harvard.edu/MedCene/ 
publlcation/sjflgure 2.html) Among the 505 previously un- 
related genes, 467 were either newly Identified genes or genes 
that had not previously been associated with any disease 
Among the remaining 38 genes. 0 had been related to other 
cancers, specifically esophageal colon, uterine, skin, and cervix. 

To determine whether the genes highlighted by the micro- 
array analysis were more likely to have been previously linked 
to breast cancer in the literature, we created a two-dimensional 
plot of the fold change of expression level between breast 
cancer and normal tissue versus the literature score (LPF) 
(Figure 3A). There was a broad spread of expression changes 
among the genes directly linked to breast cancer ranging from 
less than 1-fold change (68%) to over 40-fold (0.3%). Notably 
the majority or genes with greater than 10-foId expression 
changes were linked to breast cancer by first-degree associa- 
tion. 

Among all 754 genes directly linked to breast cancer In the 
literature, there was no correlation between LPF and micro- 
array fold change (r « 0.018. Rvalue - 0.6?). However, when 
we stratified the analysis based on the magnitude of (he fold 
change, we observed an Increasing trend In correlation (Figure 
m suggesting that genes with a more substantial chance In 
expression level were more likely to have a stronger association 
in the literature. For genes that had 10-fold change or more In 
expression level, the correlation Increased to 0.41 (rvalue - 

When we evaluated the micro-array data separately for ER 
positive and ER negative tumors, the trend In correlation 
between fold change and literature score was highly dependent 
on estrogen receptor status, interestingly, there was a similar 
trend In correlation for ER positive tumors, but no trend In 
correlation for ER negative tumors. 
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Burn* 

Diabetic Nephropathies 
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them. A tampla of the data Isshown heTe. !» <Sf rfS SKcK SilSZ £L ^ With 
term, (above tho line) an* starvation states te^ OI ** les 
diabetic small vessel complications, altered ■wum^KK^ > * nUp $' lhB " ,s 8lso C,U5W "* of 

hanwd^Wl^^ nutritional disorders, etc.(Supplemental Figure 1; http://hlpseq.rnad. 

Finally* to validate our findings, we computed similar cor- 
relations between the breast cancer expression data and 
LPF scores generated by MedCene for hypertension, a 




disease unrelated to breast cancer, As expected, we did not 
observe an Increasing trend In correlation for hyperten- 
sion. 
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Figure 3. Relationship between literature score and functional data for breast cancer Ml t»« 

samples for breast tumors and normal breast tissue were analyzed toLtel Sw» m an ««P»»*lon analysis of 

tumor and norma, samp.* (cutoff 6 3 .fold ehangTCtel S^S^^S^T ot **P""*>n (aval between breast 
Greer, dot* represent first-degree association by oene saa»h wTh^TI P .2 "Inst the literature score for the same gene set. 
dots represent no-assoclatto^ Some well ^^ S 2l R S?olTll?e^^l a, !r "» "■«■«* ™ 
expression level. Furthermore, the majority of »r»s that hWno '? r? . toCU,d ** 8 difference In 

expression changes (shaded areaJ^Thispe^men^n!^ 

or expression love, between tumor and normal brZ^^^T^ ^^^^^?^! (LPf) *• fo,d eh8 "S« 
(*exls). Gene rank lists ware generated for breast caneerfeK Ky£r£S^ tolnft S?« '° W ehan9B °' ^Pwalon'tevel 
the breast cancer gene LPF scores and fold chanoe axnr^liZ ny P 8rten3,on IP"*). CorrelaUons were also.computed between 
estrogen receptor negative turr^ onj Jurple)!^ ^ °" *"* eitr0 9 8n P««i» tumors only (light blue) and 
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Discussion 

The Human Cenome Project heralded a new era In biological 
• research where the emphasb on understanding specific path- 
ways has expanded to global studies of genomic organization 
and biological systems. High-throughput technologies can 
provide novel Insight Into comprehensive biological Junction 
but also Introduces new challenges. The utility or these 
technologies Is limited to the ability to generate, analyze, and 
Interpret large gene lists. MedGene. a relational database 
derived by mining the Information In Medline, was created to 
address this need. MedGene users can query for a rank-ordered 
list of human gene-disease relationships (Table Z) for one or 
more diseases. Each entry Is hyperUnked to the original papers 
supporting each association and to other relevant databases. 

MedCene Is an Innovative extension of previous text mining 
approaches- PerezWratxeta et «]. used the CO annotation and 
their chromosomal locations to predict genes that may con- 
tribute to Inherited disorders. 1 MedCene takes a broader view 
and Includes all diseases and all possible gene-disease relation- 
ships. Furthermore, MedCene utilizes co- citation to Indicate a 
relationship ratlier than CO annotation, which Is limited to the 
subset or genes that have CO annotation. Our approach Is 
complementary to that taken by Chaussabel and Sher, who 
used the frequency of co-cited terms to cluster genes Into a 
hierarchy of gene-gene relationships. 5 

A unique aspect of this tool Is the ability to assess the relative 
strengths of gene-disease relationships based on the frequency 
of both co-citatlon and single citation. This presupposes that 
most co-citations describe a positive association, often referred 
to as publication bias" and is supported by our observations 



that negative associations are rare (Supplemental Table 3- 

http://hlpseq.med.harvard.edU/MedGene/publlcatlon/s Ta- 
ble 3.htm|). Or course, relationships established by frequency 
or co-cltatlon do not necessarily represent a true biological link; 
however, It Is strong evidence to support a true relationship 
Another Important feature or MedCene Is the Implementa- 
tion or software filters that substantially reduced the error rate 
We estimate that less than 1 096 of all associations were missed 
and at least 70% of even the weakest associations were real 
For this study, all or the filters that we applied were general 
ones, e.g., expanding the list of all gene names to address the 
different syntax forms used by different journals, eliminating 
gene names that correspond to common English words etc 
The majority or the remaining search term ambiguities were 
Idiosyncratic and difficult tp Identify systematically without 
causing a significant rise in false negatives. Alternative ap- 
proaches, such as the examination of the nearest neighbor 
ntT' t0 COnSldered to further reduco the false PosMv* 

It Is not uncommon to see expression changes in micro- 
array experiments as small as 2-rold reported In the literature 
Even when these expression changes are statistically significant. 
It is not always clear if they are biologically meaning. When 
comparing expression levels of disease to normal tissue, one 
expects an enrichment of known disease-related genes to 
appear In the altered expression group. MedCene provided a 
unique opportunity to test this notion in the context or existing 
knowledge on a novel breast cancer micro-array dataset. For 
genes displaying a 5- fold change or less In tumors compared 
to normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease. This 
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Tablo 3. Genes with Largo Expression Changes In £A- but 
Not In £R+ Breast Tumors 
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ZICl 

TLRl 
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CDKN$ 
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LRP8 
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1.0 
1.2 
1.2 
1.9 
1.0 
2.6 
1.0 
4.0 
3.8 
47 
1,0 
1.6 

4.2 
4.4 
-1.2 
2.9 
1.0 
4,0 
-4.3 
2.9 
3.0 
4.6 
1.0 
-1.3 
-1.1 
1.3 
-4J 
1.1 
-4.6 
-1.1 
-2.8 
-1.4 
-1.0 
-1.6. 
2.4 



610.8 

mA 

6S.8 
59.6 
38.5 
33.2 
30.6 
27.9 
21.9 
18.6 
14.6 
14,4 
13.5 
13.0 
12.9 
12.3 
12.2 
118 
11.6 
11.1 
10.9 
10.2 
10.2 
10.0 
-10.4 
-10.8 
-11,4 
-15 J 
-16.2 
-22.3 
-364 
-51.5 
-64.9 
-83.1 
-85.9 
-150.3 



Table 3. MedCono Identified * sot of relatively urvJerctudted, yet highly 
ciftf*M«d gene* in ER negative, but not ER positive breast tumors, All of 
these genes have either never been co-cited with breast cancer or It&vo a 
weak association except those marked with an \ 

reflects the many genes whose role In breast cancer may not 
Involve large changes In expression In sporadic tumors (e.g., 
BRCAl and BRCA2) and genes whose modest changes Iri 
expression may be unrelated to the disease. Strikingly, among 
genes with a 10-fold change or more In expression level there 
was a strong and significant correlation between expression 
level and a published role In the disease, providing the first 
global validation or the micro-array approach to Identifying 
disease-specific genes. 

The results derived from MedGenc have two Implications. 
First, a careful hunt for corroborating evidence of a role In 
breast cancer should precede any further study of genes with 
less than 5-fold expression level changes. Second, any genes 
with 10- fold changes or more arc likely to be related to breast 
cancer and warrant attention. It Is likely that this threshold will 
change depending on the disease as well as the experiment. 

Interestingly, the observed correlation was onry found among 
ER-posltlve tumors, noi ER-negatlve. This may reflect a bias 
in the literature to study the more prevalent type of tumor In 
the population. Furthermore, this emphasizes that caution 
must be taken when interpreting experiments that may contain 
subpopulatlons that behave very differently. The MedCene 
approach identified a set of relatively understudied, yet highly 
expressed genes In ER-negatlve tumors that are worthy of 
further examination (Table 3). 
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In conclusion, we have developed an automated method of 
summarizing and organizing the vast biomedical literature. To 
our knowledge* the resulting database Is the most comprehen- 
sive and accurate of Us kind By generating a score that reflects 
the strength of the association, it provides an Important tool 
for the rapid and flexible analysis of large datasets from various 
high-throughput screening experiments. Furthermore, It can 
be used for selecting subsets of genes for functional studies, 
for building disease-specific arrays, for looking at genes com- 
mon to multiple diseases and various other high-throughput 
applications. In the future, It will be possible to enhance the 
utility of the MedCene database by building links between 
genes and other MeSH terms as well as other biological 
processes and concepts, such as cell division and responses to 
small molecules. 
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SECOND DECLARATION OF PAUL POLAKIS. Ph.D. 
I, Paul Polakis, Ph.D., declare and say as follows; 

1 . I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. 

2. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genemecfrs Tumor Antigen Project, which is a large research 
project with a primary focus on identifying tumor cell markers that find use 
as targets for both the diagnosis and treatment of cancer in humane. 

3. As I stated in my previous Declaration dated May 7, 2004 (attached as 
Exhibit A), ray laboratory has been employing a variety of techniques, 
including microarray analysis, to identify genes which are differentially 
expressed in human tumor tissue relative to normal human tissue. The 
primary purpose of this research is to identify proteins that are abundantly 
expressed on certain human tumor tissue(s) and that are either (i) not 
expressed, or (ii) expressed at detectably lower levels, on normal tissue(s). 

4. In the course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor tissue 
at significantly higher levels than in normal human tissue. To dare, we 
have successfully generated antibodies that bind to 3 1 of the tumor antigen 
proteins expressed from these differentially expressed gene transcripts and 
have used these antibodies to quantitatively determine the level of 
production of these tumor antigen proteins in both human tumor tissue and 
normal tissue. We have then quantitatively compared the levels of mRNA 
and protein in both the tumor and normal tissues analyzed. The results of 
these analyses are attached herewith as Exhibit B. In Exhibit B, "*+" means 
that the mRNA or protein was detectably overexpressed in the tumor tissue 
relative to normal tissue and means that no detectable overexpression 
was observed in the tumor tissue relative to normal tissue. 

5. As shown in Exhibit B, of the 31 genes identified as being detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the mRNA level , 28 of them (i.e., greater than 90%) are also detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the protein level . As such, in the cases where we have been able to 
quantitatively measure both (i) mRNA and (ii) protein levels in both (i) 
tumor tissue and (ii) normal tissue, we have observed that in the vast 
majority of cases, there is a very strong correlation between increases in 
mRNA expression and increases in the level of protein encoded by that 
mRNA. 



6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4-5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion thai for human genes, an increased level of mRNA m a tumor 
tissue relative to a normal tissue more often than not correlates to a similar 
increase in abundance of the encoded protein in the tumor tissue relative ro 
the normal tissue. In fact, it remains a generally accepted working 
assumption in molecular biology that increased mRNA levels are more 
often than not predictive of elevated levels of the encoded protein. In fact, 
an entire industry focusing on the research and development of therapeutic 
antibodies to treat a variety of human diseases, such as cancer, operates on 
this working assumption. 

7. I hereby declare that all statements made herein of my own knowledge are 

• true and that all statements made on information or belief are believed to be 
true, and further that these statements were made with the knowledge thai 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful statements may jeopardize the validity of the 
application or any patent issued thereon. 
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DECLARATION OF PAUL POLAKIS, Ph.D. 
I, Paul Polakis, Ph.D., declare and say as follows: 

1 . I was awarded a PkD. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job tide is Staff 
Scientist. Since joining Genentech in 1999, one of ray primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor ceil markers that find use as targets for 
both the diagnosis and treatment of cancer in humans, 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins" When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely usefiil for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed trom these differentially expressed^enelranscripls andiiavcrtised-these- 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 




expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. ' 
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extracts. If these minor cell proteins differ among cells to the same extent as the 
more abundant proteins, as is commonly assumed, only a small number of pro- 
tein differences {perhaps several hundred) suffice to create very large differences 
in cell morphology and behavior. 

A Cell Can Change the Expression of Its Genes 
in Response to Externa] Signals 3 

Most of the specialized cells in a multicellular organism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at all. These examples illustrate 
a general feature of cell specialization — different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each ceil type its permanently distinc- 
tive character. These features reflect the persistent expression of different sets of 
genes. 

Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to ftNA to Protein 4 

If differences between the various cell types of an organism depend on the par- 
ticular genes thin the cells express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle be regulated. Thus a cell can control the proteins it 
makes by (1) controlling when and how often a given gene is transcribed (tran- 
scriptional control), (2) controlling how the primary RNA transcript is spliced or 
otherwise processed (RNA processing control), (3) selecting which completed 
mKNAs in the cell nucleus are exported to the cytoplasm (KNA transport con- 
trol). H) selecting which m RNAs in the cytoplasm are translated by ri bosom es 
((ranslational control), (5) selectively destabilizing certain mHNA molecules in 
i he cytoplasm (mKNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Figure 9-2), 

For most genes transcriptional controls arc paramount. This makes sense 
because, of all the possible control points illustrated in Figure 9-2, only transcrip- 
tional control ensures that no superfluous intermediates are synthesized. In the 
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Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to he transcribed in different cells. Since many spe- 
cialized animal cells can maintain their unique character urhen grown in culture, the 
gene regulatory mechanisms involved in creating them must he stable once estab- 
lished. and heritable when the cell divides, endowing the cell with a memory of its 
developmental history. Proem yoles and yeasts provide unusmdly accessible model 
systems in which to study gene regulatory mechanisms, some of which may be rel- 
evant to the creation of specialized cell types in higher eucaryotes. One such mecha- 
nism involves a competitive interaction between two (or more) gene regulatory pro- 
teins, em it of which inhibits the synthesis of the other; this can create a JUp-Jlop 
sivitctt that switches a cell between two alternative patterns of gene, expression. I)i- 
reel oi '-indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide u general mechanism for cell memory. 

In etienryotes gene transcription is getteratly controlled by combinations of gene 
regulatory proteins. It is thought that each type of cell in d higher encoryolic organism 
contains u specific combimaion of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of cell. A given gene regulatory pro- 
tein may be expressetl in a variety of rircitmslanc.es and typically is involved in the 
regulation of many genes. 

ttf adtlifion to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also utilized by enctu yotic cells to regulnte gene expression. In ver- 
tebrates l)N;\ meihylntion alst) plays a part, mainly as a device to reinforce decisions 
alnna gene expression that are made initially by other mechanisms. 



Although ( ontiols dm the inilialion of gene I ransci iplion are the predominant, 
form of regulation lot mosi genes, other controls can :ifi l;»ier in the pathway 
h um UNA n> piotein to modulate the amount of gene product that is made. Al- 
ihongh these |msiti:mso ipliomd controls, which operate alter RNA. polymerase 
ha.s hound to the gene's promoter ami begun UNA synthesis, are less common 
than minscriptionnl control, lor many genes they ate erueial. It seems that every 
Slep in gene repression thai could he controlled in piinciplr is likely to he regu- 
lated under some circn instances Tor some genes. 

We consider the varieties of post ii anscript ional regulation in temporal or- 
der a i voiding io ihe sequence of events ihai might he e>:pei ienced hy an RNA 
moli vide alter its transcript ion has hegun tPigiire !)-7iM. 
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FROM DNA TO RNA 

Transcription and translation are ihe means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 5-3). Moreover, as we see in 
the next chapter, a cell can change {or regulate) the expression, of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 



Figure 6-3 Genes can be expressed 
with different efficiencies. Gene A rs 
transcribed and translated much more 
efficiently than gene B.This allows the 
amount of protein A in the cell to be 
much greater than that of protein B. 



I 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a ce)l takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA— ~ 
the language of a nucleotide sequence. Hence the name transcription. 

Like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodi ester bonds (Figure 6-4). It differs from 
DNA chemically in two respects: (I) the nucleotides in RNA are 
ribonucleotides— that is, they contain the sugar ribose (hence the name ribomi- 
cJeic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains. ihe base uracil (U) 
instead of the thymine (T) in DNA. Since U, like X can base-pair by hydrogen- 
bonding with A (Figure 6-5). the complementary base- pairing properties 
described for DNA in Chapters J and 5 apply also to RNA (in RNA, 0 pairs with 
C and A pairs with U). It is not uncommon, however, to find other types of bnsc 
pairs in RNA: fot example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in oveiall si incline. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single- stranded. RNA chains thereto! e fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the Final shape of 
a protein (Figure 6-6). As we see later in this chapter, the ability to fold into com- 
plex three- dimension a J shapes allows some RNA molecules to have structural 
and catalytic functions. 
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Transcription Produces RNA Complementary to 
One Strand of DNA 

AJI of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the pjocess of DNA replication discussed in Chapter 5. 
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Figure 6-90 The production of a 
protein by a eucaryotic cell. The final 

level of each protein in a eucaryotic cell 
depends upon the efficiency of each step 
depicted. 



p 



COOH 

PROTEIN DEGRADATION 



(e>H 3 N 
(£&>COOH 

ore 6-90) could be regulated by die cell lor each individual protein. However, as 
we shall see in Chapter 7, the iniiiation of transcription is the most common 
point for a cell to legitime the expression of each of its genes. This makes sense, 
inasmuch its die most efficient way lo keep a gene from being expressed is to 
block the very first step — the transcription of its DNA sequence into an KNA 
molecule. 



Summary 

The translation of the nucleotide sequence of an mltNA molecule into protein takes 
place in the cytoplasm on a large ribonncleoprotein assembly called a ribosome. The 
amino (tcitls used for protein synthesis tire first attached to a family of tHNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in the mllNA (codons). J he sequence of nucleotides in 
the mKNA is tlien read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small rihosomal suhnnit bonis to the mliNA molecule 
at a start cotton (AUG) that is recognized by a unique initiator tHNA molecule. A 
large rihosomal snbunit binds to complete the ribosome ami begin the elongation 
phase of protein synthesis. During this phase, amittoacyl tKNAs — each bearing a 
specific amino acid bind sequentially to the appropriate cotion in mHNA by forming 
complementary base pairs with the tRNA tuiticotlon. Each amino acid is added to the 
C- terminal em I of the grou'ing polypeptide by means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
eucaryotic gene expression can be 
controlled. Controls that operate at 
steps I through S are discussed in this 
chapter. Step 6, the regulation of protein 
activity, includes reversible activation or 
inactivation by protein phosphorylation 
{discussed in Chapter 3) as well as 
irreversible inactivation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to. RNA to Protein 

If differences among the various cell types of an organism depend on ihe partic- 
ular genes that the cells express, a l what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
3 cell can control the proteins it makes by ( 1) controlling when and.how often a 
given gene is transcribed (transcriptional control), (2) controlling how Ihe RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mRNAs in (he cell nucleus are exported to thecylosol 
and determining where in the cytosol they are localized (RNA transport :md 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translationaJ control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (f>) selectively acti- 
vating,, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity conirol) (Figure 7-5). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional conirol ensures that the cell will not synthesize superfluous inierme- 
-.iintes. In the following sections we discuss' the DNA and protein components 
•nat peilorm this function by regulating the initiation of gene transcription. We 
shall return at ihe end olThe chapter to the additional ways of regulating gene 
iv;piession. 



summary 

Trie genome of a cell contains in its DNA sequence the information to moke ninny 
thousands of different protein and RNA molecules, A cell typically expresses only a 
fraction of its genes, and the different types oj cells in multicelltdnr organisms arise 
itemise different sets of genes are expressed. Moreover, cells can change the pattern 
u! genes they express in response to changes in their environment, such as signals 
'>nm other cells. Although all oj the steps involved in expressing a gene can in priu- 
-. ;ple be regulated, for most genes the initiation of RNA transcription is the most 
■ :>.portnnt point of control. 



' ;ow t,0t ^ s ;l cell determine which of iis thousands of genes to transcribe? As 
» mtioned briefly in Chapters ■! and i>. the transcription of each gene is con- 
' T]ed by a regulatory region of DNA relatively near the site where transcription 

:;ms. .Some regulatory legions are simple and act as switches that are thrown 
Single Signal. Many others are complex and au as tiny microprocessors. 
\- ponding to a variety of signals that they interpret and integrate to switch the 

'^.hboiing gene on or off. Whether complex or simple, these switching devices 
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In tins study, we examined yeast proteins by two-dimensional (2D) gel electrophoresis 
and gathered quantitative information from about 1,400 spots. We found that there is an 
enormous range of protein abundance and, for identified spots, a good correlation 
between protein abundance, mRNA abundance, and codon bias. For each molecule of 
well-translated mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and ethanol media. Protein turnover wa 
examined and found to be insignificant for abundant proteins. Some phosphoproteins 
were identified. The behavior of proteins in differential centrifugation experiments was 
examined. Such experiments with 2D gels can give a global view of the yeast proteome. 
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In this study, we examined yeast proteins by two-dimensional (2D) gel electrophoresis and gathered quan- 
titative information from about 1,400 spots. We found that there is an enormous range of protein abundance 
and, for identified spots, a good correlation between protein abundance, roRNA abundance, and codon mas. 
For each molecule of well-translated mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and elhanol media. Protein turnover was examined and found 
to be insignificant for abundant proteins. Some phosphoproteins were identified. The behavior of proteins in 
. differential centrifugation experiments was examined. Such experiments with 2D gels can give a global view of 
the yeast proteome. 



The sequence of the yeast genome has been determined (9). 
More recently, the number of mRNA molecules for each ex- 
pressed gene has been measxired (27, 30). The next logical level 
of analysis is that of the expressed set of proteins. We have 
begun to analyze the yeast proteome bv using two-dimensional 
'(2D) gels. 

2D gel electrophoresis separates proteins according to iso- 
electric point in one dimension and molecular weight in the 
other dimension (21), allowing resolution of thousands of pro- 
teins on a single gel. Although modern imaging and computing 
techniques can extract quantitative data for each of the spots in 
a 2D gel, there are only a few cases in which quantitative data 
have been gathered from 2D gels. 2D gel electrophoresis is 
almost unique in its ability to examine biological responses 
over thousands of proteins simultaneously and should there- 
fore allow us a relatively comprehensive view of cellular me- 
tabolism. 

We and others have worked toward assembling a yenst pro- 
tein database consisting of a collection of identified spots in 2D 
gels and of data on each of these spots under various condi- 
tions (2, 7, 8, 10, 23, 25). These data could then be used in 
analyzing a protein or a metabolic process. Saccharomyces 
cerevisiae is a good organism for this approach since it has a 
well-understood physiology as well as a large number of mu- 
tants, and its genome has been sequenced. Given the' sequence 
and the relative lack of introns in S. cerevisiae, it is easy to 
predict the sequence of the primary protein product of most 
genes. This aids tremendously in identifying these proteins on 
2D gels. 

There are, three pillars on which such a database resls: (i) 
visualization of many protein spots simultaneously, (ii) quan- 
tification of the protein in each spot, and (iii) identification of 
the gene product for each spot. Our first efforts at visualization 
and identification for 5. cerevisiae have been described else- 
where (7, 8). Here we describe quantitative data for these 
proteins under a variety of experimental conditions. 

MATERIALS AND METHODS 
Strains and media. S. cerevisiae W303 [MATa nde2-} his3~)J,JS lcu2-3, 1J2 
trphl ura3-1 canJ-JOO) was used (26). - Mel YNB (yeas! nitrogen ba.se) medium 
was 1.7 "g of YNB (Ditco) per liiei. 5 t of ammonium sulfate per tiler, and 
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adenine, uracil, and all amino acids except methionine; -Met.-Cys YNB me- 
dium was the same but without methionine or cysteine. Medium was supple- 
mented with 2% glucose (for most experiments) or with 2% ethanol (for ethanol 
experiments). Low-phosphate YEPD was described by Warner (23). 

Isotopic labeling or yeast and preparation of cell extracts* Yeast strains were 
labeled and proteins were extracted as described by Garrets el at. (7, 8). Briefly, 
cells were grown to 5 X 10* cells per ml. al 30°C; 1 ml of culture was transferred 
to a fresh lube, and 03 mCi of [ 35 S]melhioninc (e.g., Express protein labeling 
mix; New England Nuclear) was added to this I-ml culture. The cells were 
incubated for a further 10 to 15 min and then transferred to a jj-m) microcen- 
trifuge tube, chilled on ice, and harvested by centrifugation. The supernatant was 
removed,, and the cell pellet was resuspended in 100 uJ of lysis buffer (20 mM 
Tris-1 JCI IpH 7.6], 10 mM NaF, 10 mM sodium pyrophosphate, 0.5 mM EDTA, 
0.1% deoxycholate: just before use, phenylmethylsulfonyl fluoride was added to 
J mM, leupeptin was added io J p-g/ml, pepslalin was added to 1 u-g/mL losyl- 
sulfonyl phenylalanyl chloTomethyl ketone was added to 10 ug/ml, and soybean 
trypsin inhibitor was added to 10 ug/ml). 

The resuspended cells were transferred to a screw-cap 1.5-ml polypropylene 
tube containing 0.28 g of glass beads (0.5- mm diameter; Biospec Products) or 
0.40 g ofzirconia beads (Oi-mm diameter; Biospec Products). After the cap was 
secured, the tube was inserted into a MiniBeadbeater 8 (Biospec Products) and 
shaken at medium high speed at 4°C for 1 min. Breakage was typically 75%. 
Tubes were then spun in a microcentrifuge foT JO s at 5,000 X g at 4"C 

With a very fine pipette tip, liquid was withdrawn from the beads and trans- 
fened to a piechilled 1.5-ml tube containing 7 ul of DNase I (03 mg/ml; Cooper 
product no. 6330)-RNase A (0.25 mg/ml; Cooper product no. 5679)-Mg (50 mM 
MgC) 2 ) mix. Typically 70 u.1 of liquid was recovered. The mixture was incubated 
on ice for 10 min to allow ihe RNase and DNase to work. 

Next, 75 (d of 2x dSDS (2x dSDS is 0 6% sodium dodecyl sulfate [SDS], 2% 
mercaptoethanol, and 0.1 M Tris-HCI |pH 8J) was added. The lube was plunged 
into boiling water, incubated for 1 min, and I hen plunged into ice. After cooling, 
ihe lube was ccnlrifuged a I 4°C for 3 min at 14.000 X g. The supernatant was 
transferred io a fresh tube and frozen al -70T. About 5 u.1 of this supernatant 
was used for each 2D gel. 

20 polyncrvlamide girls. 2D gels were made and run as described elsewhere 
(6-8). 

Image analysis of the gels. The Quest ]] software system was used for quan- 
titative image analysis (20, 22). Two techniques were used to collect quantitative 
data for analysis by Quest 11 software. First, before the advent of phosphor im- 
agers, gels were dried and fiuoTographed. Each gel was exposed to tilm for three 
different limes (typically 1 day, 2 weeks, and 6 weeks) to inn ease the dynamic 
range of the data. The films were scanned along with calibration strips to relate 
film optical density to disintegrations per minute in the gels and analyzed by the 
software to obtain a linear relationship between dis integral ions per minute in the 
spots and optical densities of the film images. The quaniitative data are ex- 
pressed as parts per million of the total cellular protein. This value is calculated 
from the disintegrations per minute of the sample loaded onto the gel and by 
comparing the film density of each data spot with density of the film over the 
calibration strips of known radioactivity exposed to the same film. This yields the 
disintegrations per minute per millimeter for each spot on the gel and thence its 
pajis-pei-minute value. 

After the advent of phosphorimaging. gels bearing ^S- labeled proteins were 
exposed io phosphorimagcr screens and scanned try a Fuji phosphor imager, 
typically foi two exposures per get. Calibration snips of known radioactivity were 
exposed simultaneously. Scan data from the phospborimager was assimilated by 
Quest J I software, and quamitaiivc data were recorded for I he spots on the gels. 
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Measurements of prole in turnover, Cells in exponential phase were pulse- 
labeled with [ 35 S)meihionine, excess cold Mel and Cys were added, and samples 
of equal volume were taken from the culture at intervals up to 90 min (in one 
experiment) or up to ) 60 min (in a second experiment). Incorporation of 35 S into 
protein was essentially 100% by the first sample (10 min). Extracts were made, 
and equal fractions of the samples were loaded on 2D gels (i.e., the diifcrent 
samples had different amounts of protein but equal amounts of -^S). Spots were 
quaniitated with a phosphor imaging and Quest software. 

The software was queried for spots whose radioactivity decreased through the 
time course. The algorithm examined all data points for all spots, drew a best-fit 
line through the data points, and looked for spots where this line had a statis- 
tically significant negative slope. Jn one of the experiments, there was one such 
spot. To the eye, this was a minor, unidentified spot seen only in the first two 
samples (10 and 20 min). In the other experiment, the Quest software found no 
spots meeting the criteria. Therefore, wc concluded thai none of the identified 
spots (and all but one of the visible spots) represented proteins with long 
half-lives. ° 
Centrifugal fractionation. Cells were labeled, harvested, and broken with glass 
beads by the standard method described above except that no detergent (i.e., no 
deoxycholaie) was present in the lysis buffer. The crude rysate was cleared of 
, unbroken cells and large debris by centrifugation at 300 X g for 30 s. The 
supernatant of this centrifugation was then spun at 16,000 X g.foi 10 min to give 
the pellcl used for Fig. 6B. The supernatant of the 16,000 X g, JO-min spin was 
then spun at 100,000 X g for 30 min to give the supernatant used foT Fig. 6A. 

Protein abundance calculations. A haploid yeast cell contains about A X J0~' 2 
g of protein (I, 15). Assuming a mean protein mass of 50 kDa, there are about 
50 X 10 A molecules of protein per cell. There are about 1.8 methionines per JO 
kDa of protein mass, which implies 4.5 X 10 s molecules of methionine per cell 
(neglecting the small pool of free Mel). We measured (i) the counts per minute 
in each spot on the 2D gels, (ii) the total number of counts on each gel (by 
integrating counts over itic entire gel), and (iii) the total number of counts 
loaded on the gel (by scintillation counting of the original sample). Thus, we 
know what fraction of the total incorporated radioactivity is present in each spot. 
After correcting lor trie methionine (and cysteine [see below)) content of each 
protein, we calculated an absolute number of protein molecules based on ihc 
fraction of radioactivity in each spot and on 50 X 10* total molecules per cell. 

The labeling mixture used contained about one-fifth as much radioactive 
cysteine as radioactive methionine. Therefore, (he number of cysteine molecules 
per protein was also taken into account in calculating the number of molecules 
of protein, but Cys molecules were weighted one-fifth as heavilv as Mel mole- 
cules. 

mRNA abundance calculations. For estimation of mRNA abundance, we used 
SAGE (serial analysis of gene expression) data (27) and Atfymelrix chip hybrid- 
ization dais (29a r 30). The mRNA column in Table I .shows mRNA abundance 
calculated from SAGE data alone. However, (he SAGE data came from cells 
growing in YEPD medium, whereas our protein measure men is were from cells 
growing in YNB medium. In addition, SAGE data for low-abundance mRNAs 
suffers from siatisiical variation. Therefore, we also used chip hybridization data 
(29a, 30) for mRNA from cells grown in YNB. These hybridization data also had 
disadvuniages. First, the Amounts of high-abundance mRNAs were systemati- 
cally underestimated, probably because of saturation in (he hybridizations, which 
used JO ug of cRNA. For example, the ahundance of ADHJ mRNA was 197 
copies per cell by SAGE but only 32 copies per cell by hvhridizatioh, and the 
abundance of ENQ2 mRNA was 248 copies per cell by SAGE but only 41 by 
hybridization. When the amount of cRNA used in the hybridization was reduced 
; lo 1 ug, the apparent amounts of mRNA were similar to the amounts determined 
by SAGE (29a, 29b). However, experiments using I ug of cRNA have been done 
for only some genes (29a). Because amounts of mRNA were normalized to 
15 : 000 per cell, and because the amounts of abundant mRNAs were underesti- 
mated,, there is a 2.2-lold overestimate of. the abundance of nonabundam 
mRNAs. We calculated this facior. of 22 by adding together the number of 
mRNA molecules from a large number of genes expressed at a low level for both 
SAGE data and hybridization data. The sum for the same genes from hybrid- 
ization data is 2.2-fold greater than that from SAGE data. 

To take into account ihese difficulties, wc compiled a list ol "adjusted" mRNA 
abundance as follows. For all high- abundance mRNAs of our identified proteins, 
we used SAGE data. For all of these particular mRNAs, chip hybridization 
suegeMed that mRNA abundance was the same in YEPD and YNB' media. For 
medium-abundance mRNAs, SAGE data were used, but when hybridization - 
data showed a significant difference between YEPD and YNB, then the SAGE 
data were adjusted by the appropriate factor. Finally, for' low-abundance 
mRNAs, we used data from chip hybridizations from YNB medium but divided 
by 2.2 to normalize to the SAGE results. These calculations were completed 
without reference to protein abundance. 

, CAI. The codon adaptation index (CAJ) was taken from the yeast protcome 
database (YPD) (13), for which calculations were made according to Sharp and 
Li (24). Briefly, the index uses a reference sel of highly expressed eenes ro assign 
a value to each codon. and then a score lor a^gc'ne is calculated horn the 
ficquency of use of ihc various codoas in (hat gene (24). 

Statistical analysis. The .IMP program was ussd wiih ihe aid ol T. Tully. The 
)M)> program showed thai neither mRNA not pioicin abundances were nor- 
mally distributed: ihcrelore, Spearman rank coi relation coefltoents </.) were 
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calculated. The mRNA (adjusted and unadjusted) and protein. data were also 
transformed so that Pearson product-moment correlation coefficients (r p ) could 
be calculated. First, this was done by a Box-Cox transformation of log-trans- 
formed data. This transformation produced normal distributions, and an r of 
0.76 was achieved. However, because the Box-Cox transformation is complex, we 
also did a simpler logarithmic transformation. This produced a normal distribu- 
tion for the protein data. However, the distribution for the mRNA and adjusted 
mRNA data was close to, but not quite, normal. Nevertheless, we calculated the 
r p and found that it was 0.76, identical to the coefficient from Ihe Box-Cox 
transformed data. We therefore believe that this correlation coefficient is not 
misleading, despite Ihe fact that the log(mRNA) distribution is not quite normal. 



RESULTS 

Visualization of 1,400 spots on three gel systems. Yeasi 
proteins have isoelectric points ranging from 3.1 to 12.8, and 
masses ranging from less than 10 kDa to 470 kDa. It is difficult 
to examine aU proleins on a single kind of gel, because a gel 
with the needed range in p] and mass would give poor resolu- 
tion of Ihe thousands of spots in the central region of the geJ. 
Therefore, we have used three gel systems: (i) pH "4 to 8 W with 
10% polyacrylamide; (ii) pH "3 to 30" with 10% poryacry]- 
amide; and (iii) nonequilibrium with 15% polyacrylamide (7, 
8). Each gel system allows good resolution of a subset of yeast 
proteins. 

Figure 1 shows a pH 4-8, 10% polyacrylamide gel. the pH 
at the basic end of the isoelectric focusing gel cannot be main- 
tained throughout focusing, and so the proteins resolved on 
such gels have isoelectric points between pH 4 and pH 6.7. For 
these pH 4-8 gels, we see 600 to 900 spots on the best gels after 
multiple exposures. 

The pH 3-10 gels (not shown) extend Ihe pi range somewhat 
beyond pH 7.5, allowing detection of several hundred addi- 
tional spots. Finally, we use nonequilibrium gels with 15% 
acrylamide in the second dimension. These allow visualization 
of about 100 very basic proteins and about 170 small proteins 
(less than 20 kDa). In total, using all three gel systems, about 
1,400 spots can be seen. These represent about 1,200 different 
proteins, which is about one-quarter to one-third of the pro- 
teins expressed under these condilions (27, 30). Here, we focus 
on the proteins seen on the pH 4-8 gels. 

Although nearly all expressed proteins are present on these 
gels, the number seen is limited by a problem we call coverage. 
Since (here are thousands of proteins on each gel, many pro- 
teins comigrate or nearly comigrate. When two proteins are 
resolved, but are close together, and one proiein spot is much 
more intense than the other, a problem arises in visualizing the 
weaker spot: at long exposures when the weak signal is strong 
enough for detection, the signal from the strong spot spreads 
and covers the signal from the weaker spot. Thus, weak spots 
can be seen only when they are well Separated from strong 
spots. 

For a given gel, the number of delectable spots inilially rises 
with exposure time. However, beyond an optimal exposure, the 
number of distinguishable spots begins to decrease, because 
signals from strong spots cover signals from nearby weak spots. 
Al long exposures, the whole auloradiogram turns black. Thus, 
ihere is an optimum exposure yielding the maximum number 
of spots, and at this exposure ihe weakest spots are not seen. 

Largely because of the problem of coverage, the proteins 
seen are strongly biased loward abundant proleins. All identi- 
fied proteins have a CA) of 0.18 or more, and we have iden- 
lified no transcription factors or protein kinases, which are 
nonabundanl proteins. Thus, this technology is useful for ex- 
amining proiein synthesis, amino acid metabolism, and glyco- 
lysis but not for examining transcription, DNA replication. Or 
the cell cycle. 
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Spot identification. The identification of various spots has 
been described elsewhere (7, 8). At present, 169 different spots 
representing 148 proteins have been identified. Many of these 
spots have been independently identified (2, 10, 23, 25). The 
main methods used in spot identification have been analysis of 
amino acid composition, gene overexpression, peptide se- 
quencing, and mass spectrometry. 

Pulse-chase experiments and protein turnover. Pulse-chase 
experiments were done to measure protein half-lives (Materi- 
als and Methods). Cells were labeled with [ 35 S]methionine for 
10 min. and then an excess of unlabeled methionine was added. 
Samples were taken at 0, 10, 20, 30, 60, and 90 min after the 
beginning of the chase. Equal amounts of 35 S were loaded from 
each sample; 2D gels were run, and spots were quantitated. 
Surprisingly, almost every spot was nearly constant in amount 
of radioactivity over the entire time course (not shown). A few 
spots shifted from one position to another because of post- 
translation) modifications (e.g., phosphorylation of RpaO and 
Efbl). Thus, the proteins being visualized are all or nearly all 
very stable proteins, with half-lives of more than 90 min. Gygi 
et al. (10) have come to a similar conclusion by using the N-end 
rule to predict protein half-Jives. This result does not Imply 
that all yeast proteins are stable. The proteins being visualized 
are abundant proteins; this is partly because they are stable 
proteins. 

Protein quantitation. Because all of the proteins seen had 
effectively the same half-life, the abundance of each protein 
was directly proportional to the amount of radioactivity incor- 
porated during labeling. Thus, after taking into account the 
total number of protein molecules per cell, the average content 
of methionine and cysteine^ and the methionine and cysteine 
content of each identified protein, we could calculate the abun- 
dance of each identified protein (Tables 1 and 2; Materials and 
Methods). About 1,000 unidentified proteins were also quan- 
tified, assuming on average content of Met and Cys. 

Many proteins give multiple spots (7, 8). The contribution 
from each spot was summed to give the total protein amount. 
However, many proteins probably have minor spots that we are 
not aware of, causing the amount of protein to be underesti- 
mated. 

When the proteins on a pH 4-8 gel were ordered by abun- 
dance, the most abundant protein had 8,904 ppm, the 10th 
most abundant had 2,842 ppm, the 100th most abundant had 
314 ppm, the 500th most abundant had 57 ppm, and the 
1,000th most abundant (visualized at greater than'optimum 
exposure) had 23 ppm. Thus, there is more than a 300-foJd 
range in abundance among the visualized proteins. The most 
abundant 10 proteins account for about 25% of the total pro- 
tein on the pH 4-8 gd. the most abundant 60 proteins account 
for 50%, and the most abundant 500 proteins account for 80% 
Since it seems likely that the pH 4-8 gels give a representative 
sampling of all proteins, we estimate that half of the total 
cellular protein is accounted for by fewer than 100 different 
gene products, principally glycolytic enzymes and proteins in- 
volved in protein synthesis. 

Correlation of protein abundance with mRNA abundance. 
Estimates of mRNA abundance for each gene have been made ■* 
by SAGE (27) and by hybridization of cRNA to oligonucleo- 
tide arrays (30). These two methods give broadly similar re- 
sults, yet each method has strengths and weaknesses (Materials 
and Methods) Table 1 lists the number of molecules of mRNA 
per cell for each gene studied. One measurement (mRNA) 
uses data from SAGE analysis alone (27); a second incorpo- 
rates data from both SAGE and hybridization (30) (adjusted 
mRNA) (Table J: Materials and 'Methods). We correlated 
protein abundance with. mRNA abundance (Fig. 2). Tor ad- 
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justed mRNA versus protein, the Spearman rank correlation 
coefficient, r s> was 0.74 (P < 0.0001), and the Pearson corre- 
lation coefficient, r p , on log transformed data (Materials and 
Methods) was 0.76 (P < 0.00001). We obtained similar corre- 
lations for mRNA versus protein and also for other data trans- 
formations (Materials and Methods). Thus, several statistical 
methods show a strong and significant correlation between 
mRNA abundance and protein abundance. Of course, the cor- 
relation is far from perfect; for mRNAs of a given abundance, 
there is at least a 10-fold range of protein abundance (Fig. 2). 
Some of this scatter is probably due to posttranscriptional 
regulation, and some is due to errors in the mRNA or protein 
data. For example, the protein Yef3 runs poorly on our gels, 
giving multiple smeared spots. Its abundance has probably 
been underestimated, partly explaining the low protein/mRNA 
ratio of Yef3. It is the most extreme outlier in Fig. 2. 

These data on mRNA (27, 30) and protein abundance (Ta- 
ble 1) suggest that for each mRNA molecule, there are on 
average 4,000 molecules of the cognate protein. For instance, 
for Actl (actin) there are about 54 molecules of mRNA per 
cell and about 205,000 molecules of protein. Assuming an 
mRNA half-life of 30 min (12) and a cell doubling time of 120 
min, this suggests that an individual molecule of mRNA might 
be translated roughly 1,000 times. These calculations are lim- 
ited to mRNAs for abundant proteins, which are likely to be 
the mRNAs that are translated best. 

A full complement of ceil protein is synthesized in about 120 
min under these conditions. Thus, 4,000 molecules of protein 
per molecule of mRNA implies that translation initiates on an 
mRNA about once every 2 s. This is a remarkably. high rate; it 
implies that if an average mRNA bears 10 ribosomes engaged 
in translation, then each ribosome completes translation in 
20 s; if an average protein has 450 residues; this in turn implies 
translation of over 20 amino acids per.s, a rate considerably 
higher than estimated for mammalians (3 to 8 amino acids per 
s) (18). These estimates depend on the amount of mRNA per 
cell (11, 27). 

The large number of protein molecules that can be made 
from a single mRNA raises the issue of how abundance is 
controlled for less abundant proteins. Many nonabundant pro- 
teins may be unstable, and this would reduce the protein/ 
mRNA ratio. In addition, many nonabundant proteins may be 
translated at suboptimal rates. We have found that mRNAs for 
nonabundant proteins usually have suboptimal contexts for 
Iranslational initiation. For example, there arc over 600 yeast 
genes which probably have short open reading frames in the 
mRNA upstream of the main open reading frame (17a). These 
may be devices for reducing the amount of protein made from 
a molecule of mRNA. 

Correlation of codon bias with protein obumlauce. The 
mRNAs for highly expressed proteins preferentially use some 
codons rather than others specifying the same amino acid (14). 
This preference is called codon bias. The codons preferred are 
those for which the tRNAs are present in the greatest amounts. 
Use of these codons may make translation faster or more 
efficient and may decrease misincorporation. These effects are 
most important for the cell for abundant proteins, and so 
codon bias is most extreme for abundant proteins. The effect 
can be dramatic— highly biased mRNAs may use only 25 of the 
61 codons. 

We asked whether the correlation of codon bias with abun- 
dance continues for medium- abundance proteins. There are 
various mathematical expressions quantifying codon bias; here, 
we have used the CA1 (24) (Materials and Methods) because 
it gives a result between 0 and J. The r, for CA1 versus protein 
abundance is 0.S0 (P < 0.0001), similar to the rnRNA-proiein 



TABLE I. Quantitative data" 



Function 



Name 



CAI 



mRNA 


Adjusted mRNA 


Prolein (GIu) (10 3 ) 


Protein (Eth) (I0 3 ) 


E/G ratio 


197 


197 


3,230 


972 


0.79 


0 


2.8 


0 


963 • 


>20 


1 


23 


288' 


12 


INO JvlG 




410 


974 . 


2.4 


248 


248 


650 


215 


0.33 


179 


179 . 


640 


608 


0.95 


33 


10.5 


62 


46 


0 




0 


671 


>20 


5 


5 


43 


33 


226 


226 


280 


205 


0.73 . 


5 


5 


75 


53 


0.71 


14 


14 


160 


120 


0.75 


3 


0.7 


37 


34 


5 


5 


no 


35 




63 


63 


430 


876 


NR 


460 


460 


1,670 


3,927 


NR 


NoAVtf 




No Met 


No Met 



Carbohydrate metabolism 



Protein synthesis 



Heat shock 



Amino acid synthesis 



Miscellaneous 



Adhl 

Adh2 

Cit2 

Eriol 

Eno2 

Fbal 

Hxk3,2 

JcJl 

Pdbl 

Pdcl 

Pfk3 

Pgi3 

PycJ 

Tall 

Tdh2 

Tdh3 

Tpi3 



0.810 

0304 

0.185 

0.870 

0.892 

0.868 

0300 

0.253 

0.342 

0.903 

0.465 

0.681 

0.260 

0.579 

0.904 

0.924 

0.817 



Efbl 


0.762 


33 


36.5 


Efll,2 


0.S01 


26 


26 


Prtl 


0.303 


4 


0.7 


RpaO 


0.793 


246 


246 


Tifl ,2 


0.752 


" 29 


29 


YeD 


0.777 


36 


36 


Hsc82 


0.581 


2 


2.9 


Hsp60 


0.381 


9 


2.3 


Hsp82 


0.517 


2 


1.3 


Hspl04 


0.304 


7 


7 


Kar2 


0.439 


5 • 


. 10.1 


Ssal 


0.709 


2 


43 


Ssa2 


0.802. 


30 


5 


Ssb3,2 


0.850 


50 


50 


. Sscl 


0.521 


2 


2.6 


. Ssel 


0.521 


8 


8 


StiJ 


0.247 


J 


1.3 


Adel 


0.229 


4 


4 


A A at 

/\OCj 


0.276 


2 


3.7 


/Aoej, / 


0.257 


2 


1.4 


Arg4 


0.229 


3 


8.1 


Gdhl 


0.585 


30 


27* 


Glnl 


0.524 


11 


J3 


His* 


0.267 


3 


3 


3lv5 


0.80J . 


6. 


6 


Lys9 


0.332 


4 


4 


Met6 


0.657 




22 


Pro2 


0.248 


3 


3 


Serl 


0.258 


2 


\2 


Trp5 


0.3J9 


5 


5 


Actl 


0.710 


54 


54 


Adlcl 


0.531 


No Ma 


Ald6 


0.520 


3 


3 


Atp2 


0.424 


1 


4.1 


Bmhl 


0.322 


46 


46 


Bmh2 


0.384 


1 


3.4 


Cdc48 


0.306 


2 


2.4 


Cdc60 


0.299 


2 


0.86 


Erg20 


0.373 


5 


5 


Gppl 


0.603 


36 


5 


Gsp3 


0.623 


3 


3 


ippi 


0.620 


4 


4 


Lcbl 


0.173 


0.3 


0.8 . 


Moll 


0.423 


0 


0.45 


P;»bl 


0.488 


3 


3 


Psal 


0.600 


15 


15. 


Rnr4 


0.497 


6 


6 


Sam 3 


0.494 


5 


5 


Som2 


0.497 


3 


35 


Sodl 


0.376 


36 


36 


Ubal 


0.212 


2 


2 


YKL056 


0.731 


62 


62 


YLR309 


0.549 


21 


2! 


YMR116 


0.777 


41 


41 



" CAI. 3 measure of codon bias, i 



358 
99 
32 
277 
233 
34 

132 

35 
52 
70' 
43 
303 
233 
270 
68 
96 
25 

14 
12 
14 
43 

148 
77 
• 35 

152 
32 

190 
30 
15 
28 

205 
47 

381 
76 

391 

134 

32 
6 

92 
234 

115 
254 

19 

20 

41 
148 

44 

59 

63 
631 

34 
253 
930 
184 



362 
54 
6 

300 
306 
ND 

75 
82 
335 
361 
102 
421 
324 
85 
80 
48 
44 

27 
9 
4 
41 
55 
304 
23 
109 
17 
80 
. J2 
8 
12 

164 
43 
159 
109 
137 
147 
26 
2 
39 
158 
39 ■ 
147 
40 
16 
19 
56 
37 
21 
20 
638 
20 
112 

40 



0.55 

036 
0.46 



0.67 

2.3 

2.6 

2.3 

2.4 

1.4 

1.5 

1.2 

1.7 



1.3 
3.5 

0.7 

0.52 

0.42 



0.78 



1.4 
0.72 



0.34 
0.58 



0.47 



0.44 
0.20 
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TABLE 2. Functions of proteins listed in TabJe 1 



Name" 



VPD lirlc lines* 



Adhl 
Adh2 
Cit2 
JEnoJ 
Eno2 
Fbal 
Hxkl 
Ibck2 
Jell 
• Pdbl 
Pdcl 
PflcJ 
Pgil 

Pycl 

Tall 
Tdh2 
Tdh3 
Tpil 

Efbl 
Eftl 
Eft2 
Prtl 

RpaO (RPPO) 

Tifl 

Tif2 

Yef3 

Hsc82 
Hsp60 
IJsp82 
HspHM 

Kar2 

Ssal 
Ssa2 
Ssbl 
Ssb2 
Sscl 

Ssel 
Siil 

•Adel 
Ade3 
Ade5,7 
' Ajk4 
Cdhl 
Glnl 
Mi.vt . 

Ilv5 

Kys9 
. Mci6 

Pro2 
ScrJ 
Trp5 

Act J 

Adkl 

Ald6 

Atp2 

Bmhl 

Bmh2 

Cdc48 

Cdc60 

Eri>20 

Gpp] (Rhr2) 
Gspl 
Ippl 
Lcbl 

Moll (Thi4) 
Pabl 

PSDl 

Rm4 

Saml 

Sam2 

SodJ 

Ubal 



Alcohol dehydrogenase I; cytoplasmic isozyme reducing acelaldehyde to -elhanol, regenerating NAD* 
Alcohol dehydrogenase II; oxidizes ethanol to ace t aldehyde, glucose repressed 

Citrate synthase, peroxisomal (nonmilochondrial); converts acetyl-CoA and oxaloacetate to citrate plus CoA 
Enolase 1 (Jphosphogfycerale dehydratase); converts 2-phospho-D-glycerate to phosphoenolpyruvate in etyeolysis 
Enolase 2 (2-phospbogfycerate dehydratase); converts 2-phospho-D-gJyceraie lo phosphoenolpyruvate in Irvcolvsis 
Fructose bisphosphate aldolase 11; sixth step in glycolysis . 6 / 3 . 

Hexokinase I; converts hexoses to hexose phosphates in glycolysis; repressed by glucose 

Hexokinase J J; converts hexoses to hexose phosphates in glycolysis and plays a regulatory role in glucose repression 
Jsocjtrate Iyase > peroxisomal; carries out part of the glyoxylate evele; required for cluconeoeenesis 
Pyr uvate dehydrogenase complex, El beta subunit ' 
Pyruvate decarboxylase isozyme 1 

Phosphofructokinase alpha "subunit, part of a complex with Pfk2p which carries out a key regulatory step in Rh/colysis 
Glucose-6-phosphale tsomerase, converts glucose-6- phosphate to fruclose-6-phosphale . 
Pyruvate carboxylase 1; converts pyruvate to oxaloacetate for gluconeogenesis 
Transaldolase; component of nonoxidalive part of pentose phosphate pathway 

Gb-ceraldehyde-3-phosphate dehydrogenase 2; converts D-glyceraldehyde 3-phosphate lo l^dephosphogiycerate 
CJyceraidehyde-3-phosphate dehydrogenase 3; converts D-grycer aldehyde 3-phosphale to 1,3-dephosphoglycerate 
T nosephosphate isomerase; mlerconverts gJyc*raldehyde-3-phosphale and dihydroxyacetone phosphate 

Translation elongation factor EF-lp; GDP/GTP exchange factor for TeflpyTef2p 

Translation elongation factor EF-2; contains diphthamide which is not essential for activity- identical to Efi2p 
Trans ahon elongation factor -EF-2; contains diphthamide which is not essential for activity; identical to EfllS 
Translation initiation factor elF3 beta subunit (p90); has an RNA recognition domain 
Acidic ribosomal protein AO 

Translation initiation factor 4A (elF4A) of the DEAD box family 
Translation initiation factor 4 A (eIF4A) of the DEAD box family 
Translation elongation factor EF*3A; member of ATP-binding cassette superl'amiry 

Chaperonin homologous lo E. coli HtpG and mammalian HSP90 

Mitochondrial chaperonin that cooperates with HsplOp; homolog of £ coli GroEL 

Ileal-inducible chaperonin homologous to£. coli HtpG and mammalian HSP90 

H 7i^} P ^^T^ iW ,ndUCeiJ thermoto,erancc and to "solubilizing aggregates of denatured proteins; important for [psi"* 
7»d fof^^ "° ,ei » -ross the endoplasmic reticulum membrane 

Cytoplasmic chaperone; heat shock protein of the HSP70 family 
Cytoplasmic chaperone; member of the HSP70 family 
Heal sbock protein of HSP70 familv involved in the translational apparatus 
Heal shock pro lein of HSP70 family, cytoplasmic 

M ^£JSJ?£S ^^t^J^^ Tim44p 3nd p,ays 3 ch3pcronin ro,e in receivin * anJ of p- Iei " 

*, C r?l^°At P T C ™, (?f thC " S 7? fan,) ^ ; n ? u,,ico Py suppressor of mutants with hype ractiva led Ras/cvclic AMP pathway 
S.ress-.nduced protein .eou.red for optimal growth nt high and low temperat.,re; has lelralricopeptide repeats P * 

P^hosphciribosylamidoimida2ole-succinoc a rbo M mide synth:ise: catalyzes the seventh slep in de novo purine biosynthesis pathway 
C, letrahydrofolate synthase (Afunctional enzyme), cytoplasmic H oiosymnesrs patnway 

Phosphoribosylammeglycine ligase plus phosphoribosylformvlglvcinamidine evelo- ligase; bifunctionnl orolein 
Argjnmosuccinale lyase: caiafyzes the final slep in arginine biosynthesis * - P 
Glutamic dehydrogenase <NADP + ); combines ammonia and a-ketoglutarale to form ulutamale 
Otuiamme synthetase; combines ammonia lo ghilamate in ATP-driven reaction 

£?ft^^ ppophosphohydrolase/liistidinol dehydrogenase; 2nd, 3rd, and 10th steps of 

KC anli m J hio^n 3Cid red «'°*>""«»«> <a>ph-ke.o- P .hydioxy ( acyl) reductoisomerase); second step in Val ' 

Sacchar opine dehydrogenase (NADP*, i.-glutamale forming) (sacchuropine reductase), seventh Step in lysine biosvnlhesis nathwav 

J- Glutamyl phosphate leduclase fphosphoglutarnate dehydrogenase), proline biosynlhelic ei>7 V me 
Phosphoscnne transaminase; involved in synthesis of serine from 3-phospboglycerate ' 
Jryptophan synthase, last (5lh) step in tryptophan biosynthesis pathway 

Aciin; involved in cell polarization, endocylosis, and other cvloskelelal functions 
Adenylate kinase (GTP:AMP phosphotransferase), cytoplasmic 
Cytosohc acet aldehyde dehydrogenase 

Beta subunii of Fl 'ATP synthase; 3 copies are found in each Fl oligomer 
Homolog of mammalian 14-3-3 prolein; has strong similarity to Bmh2p 
Homolog of mammalian 14-3-3 protein; has strong similaritv to Bmhlp 

^^^x^ssr required fo1 cc " diCision and homox ^ memb — ^ 

^^lolX^^^i may bc ru,e - ,imi,ine s,ep in hi »^ lh ™ 

Ran, a Gl P -binding pioiein of the Ras M.perfamily involved in trafficking through nuclear pores 
Inorganic pyrophosphatase, cytoplasmic 5 y 

Component of serine C ^hnitoyltransferase; first step in biosynthesis of long-chain base component of sphingoids 
Thj a m,n feJBed pTO(cm cswn|ia , (o , gfOW|h |he pf |man)jne - mponent ol spti.ngol.pids 

lecofcii^ °' CV ' f,p,aSrn nUCk,,S; ^ ° f ,hc 3 '- nJ «NA pMKessing complex (cleavage factor I); has 4 RNA 

Mannose- J -phosphate gu:myll f ansferasc; CDP-mannose pyrophosphory'lase 
Ribonucleotide leduclase small subunit 
5-AdenosyImelhionine synthetase 1 
A denosyl methionine synthetase 2 
Copper -zinc superoxide dismuiase 
Ubiquitin-:ictivatij)g (El) en2vme 



Y1 K R?0Q ( Ahnn Af^'r ljanshl }° n '^y controlled tumor protein of animal cells and higher plants 
YLKlUy (Anpj) Alkyl hydroperoxide reductase 

YMR116 (AscJ) Abundant protein with fffecis on translational efficiency and cell size, has two WD (WD-40) repeats 



^Accepted name from .he Savchorvmye,, genome d ; „ :t ha5e and YPD. Names in parentheses icpieseni recent chances 
Courtesy of Piolcnmc. Jnc. i cprinted with permission. 6 " 
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FIG. 2. Correlation of protein abundance with adjusted mRNA abundance. 
The number of molecules per cell of each protein is plotted against the number 
of molecules per cell of the cognate mRNA, with an r p of .0.76. Note the 
logarithmic axes. Data for mRNA were taken from references 27 and 30 and 
combined as described in Materials and Methods. • 



correlation, confirming a strong correlation between CA1 and 
protein abundance (Fig. 3). The relationship between CAI and 
protein abundance is log Jinear from about 1,000,000 to about 
10,000 molecules per cell. We have no data for rarer proteins. 

Jt is not clear whether CAJ reflects. maximum or average 
levels of protein expression. The proteins used for the CAJ- 
protein, correlation included some proteins which were not 
expressed at maximum levels under the condition of the ex- 
periment (Hsc82, Hspl04, Ssal, Adel, Arg< His4, and others). 
When- these proleins were removed from consideration and 
the correlation between CAI and the remaining (presumably 
constitutive) proleins was recalculated, the r s was essentially 
unchanged (not shown). 

The equation describing the graph in Fig. 3 is log (protein 
molecules/cell) = (2.3 X CAI) + 3.7. Thus, under certain 
conditions (a CAI of 03 or greater; a const itutively expressed 
gene), a very rough estimate of protein abundance can be 
made by raising 30 to the power of [(2.3 X CAJ) + 3.7]. 

The distribution of CAJ over the genome (Fig. 4) consists of 
a lower, bell-shaped distribution, possibly indicating a region 
where there is.no selection for codon bias, and an upper, flat 
distribution, starting at a CAI of about 0.3, possibly indicating 
a region where there is selection for codon bias. Almost all of 
the proteins whose abundance we have measured are in the 
upper, flat portion of the distribution. In the lower, bell-shaped 
region, we do not know whether there is a correlation between 
CAI and protein abundance. 

Changes in prulcin abundance in glucose and ethanol. A 
comparison of cells grown in glucose (Fig. ] A) with cells grown 
in ethanol (Fig. IB) is shown in Table J. As is well known, 
some proteins are induced tremendously during growth on 
ethanol. Two striking examples are the peroxisomal enzymes 
Jell (isocitrate lyase) and Cit2 (citrate synthase), which are 
induced in ethanol by more than 100- and 12- fold, respectively 
(Fig. 1; Table 1). These enzymes art key components of the 
glyoxylate shunt, which diverts some acetyl coenzyme A 
(acetyJ-CoA) from the tricarboxylic acid cycle to gluconeogen- 
esis. 51 cerevisiae requires large amounts of carbohydrate for its 
cell wall; in ethanol medium, this carbohydrate comes from 
gluconeogenesis, which depends on the glyoxylate shunt and 
on the glycolytic pathway running in reverse. The need for 



gluconeogenesis also explains why glycolytic enzymes are 
abundant even in ethanol medium. Thus, 2D gel analysis shows 
the prominence of the glycolytic and glyoxylate shunt enzymes 
. in cells grown on ethanol, emphasizing that gluconeogenesis, 
presumably largely for production of the cell wall, is a major 
metabolic activity under these conditions. 

During gluconeogenesis, substrate-product relationships are 
reversed for the glycolytic enzymes. One might expect that not 
all glycolytic enzymes would be well adapted to the reverse 
reaction. Indeed, 2D gels show that in ethanol, Adh2 (alcohol 
dehydrogenase 2) is strongly induced (16), while its isozyme 
Adhl is not greatly affected. Adhl and Adh2 each interconvert 
acetaldehyde and ethanol. Adhl has a relatively high K m for 
ethanol (17 imM), while Adh2 has a lower K m (0.8 mM) (5). 
Thus, it is thought that Adhl is specialized for glycolysis (ac- 
etaldehyde to ethanol), while Adh2 is specialized for respira- 
tion (ethanol to acetaldehyde) (5, 29). Similarly, Eriol (enoiase 
1) is induced in ethanol, while its isozyme Eno2 (enoiase 2) 
decreases in abundance (Table 1) (4, 19). Enol is inhibited by . 
2-phosphogrycerale (the glycolytic substrate), while Eno2 is 
inhibited by phosphoenolpyruvate (the gluconeogenic sub- 
strate) (4). Perhaps Enol has a lower K m for phosphoenol- 
pynivate than does Eno2, though to our knowledge this has not 
been tested. Thus, the 2D gels distinguish isozymes specialized 
for growth on glucose (Adhl and Eno2) from isozymes spe- 
cialized for ethanol (Adh2 and Enol). 

Many heat shock proteins (e.g., Hsp60, Hsp82, Hspl04, and 
Kar2) were about twofold more abundant in ethanol medium 
than in glucose medium. This is consistent with the increased ' 
heat resistance of cells grown in ethanol (3). 

Enzymes involved in protein synthesis (Eftl, RpaO, and Tifl) 
were about twice as abundant in glucose medium as in ethanol 
medium. This may reflect the higher growth rate of the cells in 
glucose; 

Phosphorylation of proteins. To examine protein phosphor- 
ylation, we labeled cells with 32 P and ran 2D gels to examine 
phosphoproteins. About 300 distinct spots, probably represent- 
ing 150 to 200 proteins, could be seen on pil 4-8 gels (Fig. 5B). 
We then aligned autoradiograms of three gels, each with a . 
different kind of labeled protein ( 32 P only [Fig. 5B), 32 P phis 
35 S | Fig. 5 A], and 35 S only [not shown, but see Fig. 1 for . 
example]). Jn this way, we made provisional identification of 
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FIG. 3. Correlation of protein abundance wiih CAI. The number of mole- 
cules per cell of each protein is plotted against the CAJ for thai protein. Note ihe 
logariihmic scale on the protein axis. Data for the CAJ aie born ihe YPD 
database (13). 
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FIG. 4. Distribution of CAI ovct the whole genome, shown in intervals of 0.030 (i.e., there are 150 genes with a CA1 between 0.000 and 0.030, inclusive- 3J genes 
with a CAJ between 0.03) and 0.060; 269 genes with a CAI between 0.061 and 0.090; 1,296 genes with a CA) between 0.091 and 0.120; etc) The distribution peaks 
with 2,028 genes with a CAI between 0.121 and 0.150. 



some of the 32 P-labeled spots as particular 35 S-Jabeled spots. 
Ail such identifications are somewhat uncertain, since precise 
alignments are difficult, and of course multiple spots may ex- 
acily comigrate. Nevertheless, we believe that most of the 
provisional identifications are probably correct. Among the 
major 32 P-labeled proteins are the hexokinases Hxkl and 
Hxk2. the acidic ribosome-associated protein RpaO, the trans- 
lation factors Yef3 and Efbl, and probably Bsp70 heat shock 
proteins of the Ssa and Ssb families. RpaO and EfbJ are quan- 
titatively monopbosphorylaled. 

Many yeast proteins resolve into multiple spots on these 2D 
gels (7). Yef3 has five or more spots, at least four of which 
comigrate with 32 P. Tpil has a major spot showing no ?2 P 
labeling and a minor, more acidic spot which overlaps with 
some 32 P label. Tifl has at least seven spots (7); two of these 
overlap with some 32 P labeL but five do not (Fig. 5). Eftl has 
at least three spots (7), and none of these overlap with 32 P, 
although there are three nearby, unidentified 32 P-labeled spots 
(a, c, and d in Fig. 5). Spots that seem to be extra forms of 
Met6, Pdcl, Eno2, and Fbal can be seen in Fig. 6A, but there 
is little 32 P at these positions in Fig. 5. Thus, phosphorylation 
explains some but not all of the different protein isofonns seen. 

The cell cycle is regulated in part by phosphorylation. We 
compared 32 P-labeled proteins from cells synchronized in Gj 
with a -fact or, in cells synchronized in G, by depletion of G, 
cyclins, and in cells synchronized in M phase with nocodazole. 
Only very minor differences were seen, and these were difficult 
to reproduce. The cell cycle proteins regulated by phosphory- 
lation may not be abundant enough for this technique to be 
applied easily. 

Centrifugal fractionation. We fractionated 35 S-)abeled ex- 
tracts by centrifugation (Materials and Methods). Figure 6A 
shows the proteins in the supernatant of a high -speed 
(100,000 X g, 30 min) centrifugaiion, while Fig. 6B shows Ihe 
proteins in the pellet of a low-speed (I6 : 000 X 30 min) 
centrifugation. Many proteins are tremendously enriched in 
one fraction or the olher, while others are present in both. 



Most glycolytic enzymes (e.g., Tdh2, Tdh3, Eno2, Pdcl, Adhl, 
and Fbal) are enriched in the supernatant fraction. The only 
exception is Pfkl (not indicated), which is found in both pellet 
and supernatant fractions. Many proteins involved in protein 
synthesis (Eftl, Yef3, Prtl, Tifl, and PpaO) are in the pellet, 
possibly because of the association of ribosomes with the en- 
doplasmic reticulum. However, Efbl is in the supernatant, as is 
a substantial portion of the Eft 1. Perhaps surprisingly, several 
mitochondrial proteins (Alp2 [not shown] and llv5) are largely 
in the supernatant. Perhaps glass bead breakage of cells re- 
leases mitochondrial proteins. The nuclear protein Gspl is in 
the pellet fraction. The enrichment produced by centrifugation 
makes it possible to see minor spots which are otherwise poorly 
resolved from surrounding proteins. Figure 6B shows that the 
previously identified Tifl spot is surrounded by as many as six 
olher spots that cofractionnte. We observed six identical or 
very similar additional spots when we overexpressed Tifl from 
a high-copy-number plasmid (not shown). Signal overlaps only 
one or two of these spots in 3? P-Iabeling experiments (Fig. 5), 
and so the different forms are not mainly due to different 
phosphorylation states. 

* DISCUSSION 

Our experience with developing a 2D gel protein database 
for 5. ccrevisiae is summarized here. With current technology, 
we can see the most abundant 1,200 proteins, which is about 
one-third to one-quarter of the proteins expressed. The re- 
maining proteins will be difficult to see and study with the 
methods that we have used, not because of a lack of sensitivity 
but because weak spots are covered by nearby strong spots. 

Of the 1,200 proteins seen, we have identified 348, with a 
bias toward Ihe most abundant proteins. Steady application of 
the methods already used would allow identification of most of 
the remaining proteins. Gene overexpression will be particu- 
larly useful, since it is not affected by the lower abundance of 
the remaining visible proteins. 
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FIG 5. Phosphorated proteins. (A) Mixture of ^labeled proteins and ^-labeled proteins. Two ., epyiyte inkling reactions were done, one with » P and onc 
w.th S, and extract* were mrxed and r un on a 2D get. Spols marked with numbers rather than gene name* represent spots noted on »S gels but unidentified. Spots 
labeling with P were identified by (,) incited labeling compared to the ^S-only gel (not show„); * if j the characteristic fuzzincss of a 37 P-labeled <por and (iii) the 
decay of Signal .ntcns.ry seen on rxp.wuic.-t made 4 wcclts later (not shown). A minor form ol Tpil and at least si.x minor forms of TO have been noted in ovcicxpicwion 
cxpei.mcnts (:see also F.g. 6B); pos.hons of the minor In.mj. are indicated by circles. (B) "P-onlv labeling. The major form of Tpil which W no. labeled wiih n P is 
indicated by a large cucle; portions of M.-ver> form* of Tif I aie indicated by smaller circles. 
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FIG. 6. Fractionation by ccnrntuention. (A) Proieins in ihc supernatant of a 100,000 X g, 30-min spin; proteins in rhc pellet of a 16.000 X K , 10-min spin Supcnuiiani 
li actions t^mim-d in multiple experiments done over » wide range of 5 forces rooked similar lb eairh-oiher, as did the pellel iiaci'ions. ' 



7366 



> 



Vol 19, 1999 

2D gels of the kind that we have used are not suitable for 
visualization of rare proteins. However it will be possible to 
study on a global basis metabolic processes involving relatively 
abundant proteins, such as protein synthesis, glycolysis, glu^ 
coneogenesis, amino acid synthesis, cell wall synthesis, nucle- 
otide synthesis, lipid metabolism, and the heat shock response. 

Gygi et al. (10) have recentry completed a study similar to 
ours. Despite generating broadly similar data, Gygi et al. 
reached markedly different conclusions. We believe that both 
mRNA abundance and codon bias are useful predictors of 
protein abundance. However, Gygi et al. feel that mRNA 
abundance is a poor predictor of protein abundance and that 
"codon bias is not a predictor of either protein or mRNA 
levels" (30). These different conclusions are partly a matter of 
viewpoint. Gygi et al. focus on the fact that the correlations of 
mRNA and codon bias with protein abundance are far from 
perfect, while we focus on the fact that, considering the wide 
range of mRNA and protein abundance and the undoubted 
presence of other mechanisms affecting protein abundance, 
the correlations are quite good. 

However, the different conclusions are also partly due to 
different methods of statistical analysis and IQ real differences 
in data. With respect to statistics, Gygi et al. used the Pearson 
product-moment correlation coefficient (r p ) to measure the 
covariance of mRNA and protein abundance. Depending on 
the subset of data included, their r p values ranged from 0.1 to 
0.94. Because of the low r values with some subsets of the 
data, Gygi et al. concluded that the correlation of mRNA to 
protein was poor. HoweveT, the r p correlation is a parametric 
statistic and so requires variates following a bivariate normal 
distribution; that is, it would be valid only if both mRNA and 
protein abundances were normally distributed. In fact, both 
distributions are very far from normal (data not shown), and so 
a calculation of r p is inappropriate. There was no statistical 
backing for the assertion thai codon bias fails to predict pro- 
tein abundance. 

We have taken two statistical approaches. First, we have 
used the Spearman rank correlation coefficient (r,). Since this 
statistic is nonparametric. there is no requirement for the data 
to be normally distributed. Using the r jr we find that mRNA 
abundance is well correlated with protein abundance {r s = 
0.74), and the CA1 is also well correlated with protein abun- 
dance (r s = 0.80) (and also with mRNA abundance [data not 
shown]). For the data of Gygi el al. (10), we obtained similar 
results, though with their data the correlation is nol as goodjr, 
= .0.59 for ihe mRNA-io-prolein correlation, and r s — 0.59 for 
the codon bias-lo-piolein correlation. 

In a second approach, we transformed the mRNA and pro- 
tein data to forms where they were normally distributed, to 
allow calculation of an r p (Materials and Methods). Two trans- 
formations, Box- Cox and logarithmic, were used; both gave 
good correlations with our data [e.g.. r p = 0.76 for lbg( adjusted 
RNA) to log(protein)]. Wc were not able to transform Ihe data 
of Gygi et al. to a normal distribution. 

Finally, there are also some differences in data between the 
two studies. These may be partly due to ihe different measure- 
ment techniques used: Gygi el al. measured protein abundance 
by cutting spots out of gels and measuring the radioactivity in 
each spot by scintillation counting, whereas we used phospbo- 
rimaging of intact gels coupled to image analysis. We com- 
pared our dala to theirs lor the proteins common between the 
studies (but excluding proteins whose mRNAs are known to 
differ between rich and minimal media, and excluding Tifl, 
which was anomalous in differing by ) 00- fold between ihe iwo 
data sets). The r, between the two protein dala sets was 0.88 
{P < 0.0001). Although this is a strong correlation, the fact thai 
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it is less than 3.0 suggests that there may have been errors in 
measuring protein abundance in one or both studies. After 
normalizing the two data sets to assume the same amount of 
protein per cell, we found a systematic tendency for the protein 
abundance data.of Gygi et al. to be slightly higher than ours for 
the highest-abundance proteins and also for the lowest-abun- 
dance proteins but slightly lower than ours for the middle- . 
abundance proteins. These systematic differences suggest some 
systematic errors in protein measurement. Although we do.not 
know what Ihe errors are, we suggest the following as a rea^. 
sonable speculation. For the highest-abundance proteins, we 
may have underestimated the amount of protein because of a 
slightly nonlinear response of the phosphor imager screens. For 
the lowest-abundance proteins, Gygi et a!, may have overesti- 
mated the amount of protein because of difficulties in accu- 
rately cutting very small spots out of the gel and because of 
difficulties in background subtraction for these small, weak 
spots. The difference in the middle abundance proteins may be 
a consequence of normalization, given the two errors above. 

The low-abundance proteins in the data set of Gygi et al. 
have a poor correlation with mRNA abundance. We calculate 
that the r s is 0.74 for the lop 54 proteins of Gygi et a I. but only 
0.22 for the bottom 53 proteins, a statistically significant dif- 
ference. However, with our data set, the r s is 0.62 for the top 33 
proteins and 0.56 (not significantly different) for the bottom 33 
proteins (which are comparable in abundance to the bottom 
53 proteins of Gygi et al.). Thus, our data set maintains a good 
correlation between mRNA and protein abundance even at 
low protein abundance. This is consistent with our speculation 
that protein quantification by phosphorimaging and image 
analysis may be more accurate for small, weak spots than is 
anting out spots followed by scintillation counting. Our rela- 
tively good correlations even for nonabundant proteins may 
also reflect Ihe fact that we used both SAGE data and RNA 
hybridization data, which is most helpful for Ihe least abundant 
mRNAs. In summary, we feel that the poor correlation of 
protein to mRNA for the nonabundant proteins of Gygi el al. 
may reflect dilliculty in accurately measuring these nonabun- 
dant proteins and mRNAs. rather than indicating a truly poor 
correlation in vivo. It is not surprising that observed correla- 
tions would be poorer with less-abundant proteins and 
mRNAs, simply because the accuracy of measurement would 
be worse. 

How well can mRNA abundance predict protein abun- 
dance? Wilh r p - 0.76 for logarithmically transformed mRNA 
and protein data, the coefficient of determination, (r^) 2 , is 0.58. 
This means that more than half (in log space) of Ihe variation 
in protein abundance is explained by variation in mRNA abun- 
dance. When converted back to arithmetic values, protein 
abundances vary over about 200-fold (Table I), and (r p ) 7 - 
0.58 for the log data means thai of this 200-fold variation, 
about 20-fold is explained by variation in the abundance of 
mRNA and about 10- fold is unexplained (bnt could be due 
partly to measurement errors). For proteins much less abun- 
dant than those considered here, we imagine the in vivo cor- 
relation between mRNA and protein abundance will be worse, 
and other regulatory mechanisms such as protein turnover will 
be more important. 

Some important conclusions can be drawn from this sam- 
pling of the proieome. First, there is an enormous range of 
protein abundance, from nearly 2,000,000 molecules per cell 
for some glycolytic enzymes to about 100 per cell for some cell 
cycle proteins (26a). Second, about half of all cellular prorein 
is found in fewer than 100 different gene products, which are 
mostly involved in carbohydrate metabolism or protein syndic* 
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sis. Third, the correlation between protein abundance and CAJ 
is log linear as far as we can see, which is from about 10,000 
protein molecules per cell to about 1,000,000. This is somewhat 
surprising, because it implies that selective forces for codon 
bias are significant even at moderate expression levels, it also 
means that codon bias is a useful predictor of protein abun- 
dance even for moderately low bias proteins. Fourth, there is a 
good correlation between protein abundance and mRNA 
abundance for the proteins that we have studied. This validates 
the use of mRNA abundance as a rough predictor of protein 
abundance, at least for relatively abundant proteins. Fifth, for 
these abundant proteins, there are about 4,000 molecules of 
protein for each molecule of mRNA. This last conclusion 
raises questions as to how the levels of nonabundarit proteins 
are regulated and suggests that protein instability, regulated 
translation, suboptimal rates of translation, and other mecha- 
nisms in addition to transcriptional control may be very impor- 
tant for these proteins. 
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correlation between galanin peptide levels and galanin mRNA expression. In some 
rumors galanin mRNA and POMC levels coexisted, in others they were essentially in 
different cell populates. Levels of plasma galanin-LI were not related to tumor galanin 
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Abstract 

Tumor galanin content was measured in extracts from human pituitary adenomas using a specific RIA method for monitoring human 
galanin. Twenty-two out of twenty -four tumors contained galanin with notably high levels in corticotroph adenomas, varying levels in 
clinically inactive tumors, and low levels in GH secreting adenomas. Tumor galanin and ACTU contents were closely correlated in all 
tumors. In tour young patients with nueroadenornas and highly active Mb Cushing tumor galanin was inversely reluted to tumor volume. The 
molecular form of tumor galanin* studied with reverse-phase HPLC, was homogenous with the majority of tumor galanin coeluting with 
standard human galanin. In the tumors analysed with in situ hybridization there was a good correlation between galanin peptide levels and 
galanin mRNA expression. In some tumors galanin mRNA and POMC levels coexisted, in others they were essentially in different cell 
populations. Levels of plasma galanin-LI were not related to tumor galanin concentration, and galanin levels were in the same range in sinus 
petrosus close to the pituitary venous drainage as in peripheral blood. Corticotrophin releasing hormone injections in two patients caused 
ACTH, but no detectable galanin release into sinus petrosus. Our results demonstrate that corticotroph, but not GH adenomas, express high 
levels of galanin, in addition to ACTH, and mat in some tumors both polypeptides are synthesised in the same cell population. However, 
galanin, levels in plasma were not influenced by the tumor galanin content. 
<0 2(K)3 Elsevier B.V. All righLs reserved. 
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1. Introduction 

Galanin was first isolated from porcine intestine in 1983 
by Tatemoto et al. [1]. In most species galanin is a 29 amino 
acid, C-terminally amidated peptide, whereas the human 
form contains 30 amino acids with a non-amidated C- 
terminal [2-5]. A major breakthrough was the cloning of 
the rat galanin gene from the rat anterior pituitary and 
hypothalamus, and the demonstration of its estrogen sensi- 
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tivity [6,7]. Galanin has been shown to have multiple 
biological effects. Thus, it seems to be involved in, for 
example, regulation of feeding (see Refs. [8-10]), in 
learning (see Refs, [11,12]), and in responses to nerve injury 
and pain (see Ref. [13]). The first galanin receptor was 
cloned by Wabert-Ortholi et al. [14] and subsequently two 
further galanin receptor subtypes were identified [15,16]. 
The creation of a galanin knock-out mouse [17] and of 
galanin overexpressing mice [18-20] have provided new 
insights into the physiological roles of galanin. ' 

Galanin -like immunoreactivify (LI) is spread throughout 
the central nervous system, including the hypothalamus, 
with the most intense immunoreactivity in the median 
eminence [21-24], suggesting participation in neuroendo- 
crine processes controlling anterior pituitary hormone se~ 
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cretion (see Refs. [9,25-27J). For example, galanin 
increases GH secretion not only in animal models but also 
in humans. Thus, given intravenously (Lv.) to man galanin 
gives rise lo a GH peak, delayed in time but equal to and 
additive to that produced by GHRH [28,29J. On the other 
hand, galanin has been shown to inhibit the hypothalamo- 
pituilary -adrenal (HPA) axis [30-33]. Galanin has been 
proposed to act as a mitogen in estrogen-induced prolacti- 
nomas in the rat [34,35]. Galanin gene expression also 
shows a direct correlation to somatotroph hyperplasia in 
GHRH transgenic mice [36 J. 

In the rat pituitary galanin is mainly present in the 
lactolrophs [7], whereas in the human pituitary galanin is 
coexpressed with ACTH. Thus, several groups [37-44] 
have studied galanin-LJ in pituitary adenomas establishing 
its presence in ACTII-producing adenomas, but with vary- 
ing results as lo ils presence in somatotroph, laetotroph and 
clinically inactive adenomas. 

The aim of this study was to investigate the presence, 
concentration and molecular form(s) of galanin in different 
types of pituitary adenomas using an extraction procedure 
and a newly developed radioimmunoassay (R1A), as well as 
reverse-phase (RP)-HPLC. In addition, we wished to com- 
pare rumor and serum levels of galanin to some other 
pituitary hormones, with special reference to the ACTH- 
eortisol axis. Finally, some tumors were analysed with in 
situ hybridisation for expression of galanin and POMC. The 
results of these analyses were correlated to clinical data. 
Preliminary data from this investigation have previously 
been reported [45]. 

2. Material arid methods 

2 J. Patients 

Samples for tumor extraction were obtained from 24 
patients with pituitary adenomas (Table 1), 15 women (26- 
78 years, mean 49.7 years) and 9 men (31 74 years, mean 
53.3 years) who underwent transsphenoidal surgery at the 
Department of Otorhinolaryngology. Karolinska Hospital. 
Seven patients had hypersecretion of ACTH, six of GH, and 
one of prolactin (PRL). Ten tumors were clinically inactive 
(Table 1). The tumors were identified by clinical symptoms 
and signs as well as magnetic resonance imaging scans. The 
size of the seven ACTH-producing tumors was measured on 
contrast-enhanced Tl -weighted coronal and sagittal images 
after correction for scaling. The standard formula for vol- 
ume measurement of a rotational ellipsoid was used for 
volume calculations [461. The size of the remaining tumors 
was assessed with stage and grade according to Hardy and 
Wilson (see Ref. [47]). Separate tumor samples were also 
analysed in the Department of Pathology of the Karolinska 
Hospital. 

The patients with dishing 's disease had clinical symp- 
toms and signs of hypercortisolism. The diagnosis was 



confirmed by elevated ACTH and Cortisol levels, absent 
diurnal rhythm, pathological CRH-test (n-6) with sinus 
petrosus sampling (« = 4) or low and high dose dexamela- 
sone (/t = 1). Ail had pituitary microadenomas visible on 
MR1 scans. They were all cured clinically and biochemi- 
cally after surgery. Separate pathological anatomical exam- 
ination confirmed a pituitary adenoma in six cases. In one 
case (#6) there was insufficient material for analysis. 

Galanin was monitored in two patients during sinus 
petrosus sampling. Samples wctc not available for galanin 
analysis from a further two patients who underwent sinus 
petrosus sampling, confirming in both cases ACTH produc- 
ing pituitary adenoma. For control, plasma samples were 
collected from 22 healthy individuals (12 men, 28-55 years 
old; 10 women 25-62 years old). 

The study was approved by the Bthics Committee of 
Karolinska Instimtet (nos. 96-145 and 99-326). 

2.2. Samples 

Tumor samples for extraction and in sita hybridisation 
were collected at surgery, placed on dry ice and kept frozen 
at ■ 70 "C. 

Blood samples for galanin measurement were collected 
in EDTA plasma tubes in the morning fasting state prior to 
surgery and during sinus petrosus sampling. Samples were 
placed on ice and centrifuged at 4 n C (2400 rpm) for 10 

min. Plasma was withdrawn and stored at 70 °C until 

analysis. 

2.3. Extraction procedure 

For extraction, frozen tumor tissues were weighed, cut, 
still in fro/en state, boiled for 10 miri in acetic acid (I mol/ 
1), cooled on ice, homogenized and centrifuged for 10 min 
(2000 rpm) at 4 °C. After collection of the supernatant, the 
pellet was resuspended in water, followed by a second 
boiling, homogenization and centrifugation as above. The 
supeniatants from these two extraction procedures were 
pooled, lyophilised and stored at 4 °C. Samples were 
reconstituted in phosphate buffer (0.05 mol/1; pH 7.4) before 
assay. 

2 A. Gel permeation chromatography 

Gel permeation chromatography was performed on one 
tumor rich in galanin (#5; Table 1). A Sephadex G-50 
Superfine (Pharmacia, Uppsala, Sweden) column (2.6 x 97 
cm) was equilibrated and eluted at room temperature with 
formic acid (0.1 mol/1) with 0.02% sodium azide. Void 
volume (V 0 ) was determined with Dextran blue and total 
volume (V x ) with 22 Na, Human synthetic galanin (Peninsula, 
Belmont, CA) in RLA buffer was used for column calibra- 
tion. Tumor extract (50 uJ in 1 ml water) was eluted at a flow 
rate of 15 ml/h, and the eiuate was collected in fractions 
(2 ml), which were lyophilised and stored at 4 °C until 
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2J 


Inactive 


62 


to 


0.21 




3.48 


16.57 


22 


Inactive 


69 


f 


0.1 3 a 


40 


6.45 


49.61 


23 


Inactive 


72 


f 


1.85 


82 


20.16 


10.9 


24 


Inactive 


74 


in 


5.67 


38 


553.6 


97.6 



Inactive; clinically inactive adenoma. 

Patients are arranged according to turnoT type, and age. Sex, rumor galanin content (T-GA1.) and plasma galanin (P-GA1 ) arc listed. 
11 Below detection limit in the KlAr-For the purpose of calculation, they were assigned values of 20 pmo!/l, i.e. the limit of detection. 



assay. For RIA, each fraction was dissolved in 100 ul RiA 
buffer. 

2.5. Reverse-phase-HPLC 

Extracts from the nine adenomas containing the largest 
amounts of galania, i.e. seven ACTH-secreting adenomas 
(#1-7; Table ]) and two clinically inactive adenomas 
(#16, #19; Table I) were analysed with RP-HPLC. A 
218TP Ci8-Column (5 urn, 4.6x250 nun) from Vydac 
(Hesperia, CA) was used to separate the galanin-LL The 
samples were eluted (1 ml/rain) with a linear gradient of 



Table 2 

Seven patients with ACTH producing pituitary adenomas 



water/acetonitrile (HPLOgrade, Merck, Darmstadt, Ger- 
many) with 0.1% tritluoroacetic acid (Merck) increasing 
from 20% to 70% by 1%/min. Aliquots of 0.5 ml were 
sampled. The fractions were lyophilized and redissolved in 
100 ul RIA buffer before analysis. One sample with very 
high galanin content (#1; Table 2) was diluted 1:10 before 
chromatography. 

2. 6. Radioimmunoassay 

Antiserum to human galanin(l -30) was produced as 
described before [48]. Briefly, human galanin(l -30) was 



Pat. 

id. # 



T-gal, pmol/ 
mg protein 



T-ACTH. pmol/ 
mg protein 



r-ACTII, 
pmol/1 



S-Cortisol, 
nmol/1 



U-Corttsol, 
nmol/24 li 



1 


620-16 


2737.4 


60 


1190 


2360 


2 


587.31 


1302.24 


7.3 


576. 


1985 




nu9 


657.27 


30 


476 


3192 


4 


850.67 


72666.67 


17 


639 


3394 • 


5 


82 J 5 


18860 


36 


794 


794 


6 


247.38 


4065.04 


15 


579 


1056 


7 


'43.51 


551.05 


5.2 


558 


846 



^ The tabic shows tumor tissue content of galanin (T-GAL) and ACTH (T-ACTH) in pmol/mg protein, morning lasting blood levels of ACTII and Cortisol and 24 
h urinary Cortisol excretion. 
Reference ranges: P-ACTH, 2-11 pmol/I. 
S-Cortisol, 170-600 nmol/1. 
U-Cortisol, 70 - 500 nmol/24 h. 
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synthesized by Fmoc-solid phase synthesis and purified to 
apparent homogeneity by RP-HPLC. Five mg galanin were 
coupled to 30 mg keyhole limpet hemocyanine with 0.25 mg 
earbodiimide. New Zealand white rabbits were immunized 
with 0.25 mg peptide equivalent dissolved in 1 ml saline 
emulsified in 1.5 ml complete Freunds adjuvant (Sigma, St. 
Louis, MO). Booster injections at 4-6 weeks intervals were 
done with 0.08-0.1 mg peptide equivalent mixed with 1.5 
ml incomplete Freunds adjuvant. The antiserum used was 
obtained 10 days after the third booster injection. 

Final antibody concentration in the assay was 1:750,000 
yielding a 30% binding. Synthetic human galanin (Penin- 
sula) was used as standard and 1 2 ^ I- la be lied human galanin 
(New England Nuclear, NliN, Boston, MA) as tracer. The 
R1A butYer was a phosphate buffer (0.05 mol/l; pH 7.4) 
containing 0,1% BSA, 0.02% NaN 3 , 0.01% Triton X, and 
10 ml Trasylol/I buffer. 

In the assay, 100 pi of standard or samples in duplicates 
(or in single samples from chromatography) were preincu- 
bated with 100 jil antibody for 24 h at 4 °C. After the 
addition of 100 \i\ traeeT (approx. 8000 epm) incubation 
continued for another 24 h. Free and bound radioligand 
were separated by adding 500 u.1 sheep anti-rabbit anti- 
bodies (Pharmacia decanting suspension; Pharmacia, 
Uppsala, Sweden). After 30 rain incubation and addition 
of 500 jjlI water, the samples were centrifuged at 4 °C (3000 
rpm) for 15 min and decanted* and the radioactivity in the 
bound fraction was counted in a gamma counter. 

Serial dilutions of tumor extracts showed good parallel- 
ism with the standard curve. Non-specific binding in the 
RIA was 2%. Detection limit was 20 pmol/L and 1C50 312 
pmol/1. Intraassay variation was 5% and interassay variation 
14%. 

Crossreactivity was measured at 50% binding, or at the 
highest binding for those peptides not attaining a 50% 
binding. In this assay, there was 62% cross-reactivity with 
rat galanin, 36% with galanin(l-16), and 0.8%. with pan- 
creastatin, whereas less than 0.1% was found for gala- 
nin(20-30), galanin message-associated peptide(l -41 ) or 
— (44-49), CRH, GHRH, somatostatin, vasoactive intesti- 
nal polypeptide, peptide histidine-isoleucine, calcitonin 
gene-related peptide or oxytocin, 

ACTH in tumor tissue was analysed by an immunor- 
adiometric assay (IRMA) from Brahms Diagnostica (Berlin, 
Germany) using two monoclonal antibodies recognizing 
different binding sites on the antigen. The assay was carried 
out in two steps to prevent antigen excess resulting in falsely 
low values. The tumor samples were diluted stepwise with 
the zero-standard. 

Protein content was analysed with the Bio Rad Protein 
Assay (BioRad* Munich, Germany), 

2. 7. Preparation of probes 

Altogether three oligonucleotides (Scandinavian Gene 
Synthesis, Koping, Sweden) were used in this study for in 



situ detection of POMC mRNA (»= 1) and human galanin 
mRNA (n = 2). Oligonucleotides were designed as below 
from published gene sequences. 

(A) Oligonucleotide complementary to POMC mRNA 
[49]: 

(i) 5'GCCCTCCCGTG G ACTTGG CTCCG G ACG GC- 
CATCTCCCCCCGCCGTCTCTTCCTC3' 

(B) Oligonucleotides complementary to human galanin 
mRNA [3J: 

(i) 5'CTGrGGrrGCCAACGGCATGTGGGCCCA- 
GCAGGTAGCCCGCGCTGTTC3' 

(ii) 5 r GTCAAAGCTTCCTGGTTTCATGTCATC- 
TrCGGGCCGCAGCACCCGCTry 

All oligonucleotides were chosen in regions presenting 
few homologies with sequences of related mRNAs, and they 
were checked against the Gen bank database. 

The oligonucleotides complementary to human galanin 
mRNA were labelled as previously described [50] at the 
3'-end using terminal deoxy nucleotidyl transferase (TdT) 
(Amersham, Arnersham, UK) in a cobalt-containing buffer 
with 35 S-dATP (New England Nuclear, Boston, MA, 
USA) to a specific activity of 1-4 xlO 9 cpm/ug and 
purified in Nensorb-20 columns (New England Nuclear), 
The oligonucleotide complementary to POMC mRNA was 
labelled by tailing at the 3'-end with digoxigenin-1 1- 
dUTP (Boehringer Mannheim, Mannheim, Germany) 
according to published protocols [51]. Briefly, 100 pmol 
of each probe were incubated in a final volume of 20 
\i\ with 1 nmol of digoxigenin-1 1-dUTP, 9 nmol of dATP 
(Sigma), 50 units of TdT and 4.2 jil of cobalt-containing 
buffer. AfteT 45 min at 37 °C the reaction was stopped 
before purification by ethanol precipitation, and probes 
were then stored at -20 °C. 

2.8. In situ hybridization procedures 

. The frozen pituitary tumors were cut at 14 wn thickness 
in a cryostat (Microm, Heidelberg, Germany) and thaw- 
mounted onto Probe On slides (Fisher Scientific, Pittsburgh. 
PA). The sections were then processed as described earlier 
[50]. In brief, tissue sections were air-dried and incubated 
for 16 h at 42 °C with 0.5 ng of each of the two 
radioactively labelled human galanin probes and 10 nmol/ 
1 of the digoxigenin-labelled POMC probe. The probes were 
diluted in a hybridization solution containing 50% deionized 
formamide (Baker, Deventer, The Netherlands), 4 x stan- 
dard saline citrate (SSC, 1 x SSC = 0.1 5 M NaCl and 0.015 
M NaCitrate), i x Denhardt's solution {0.02% bovine 
serum albumin, 0,02% Ficoll (Pharmacia), 0.02% polyvi- 
nylpyrrolidone}, 0.02 M NaP0 4 fpH 7.0), 1% iV-lauroyl- 
sarcosine, 10% dextran sulphate (Pharmacia), 500 mg/ 
1 denatured salmon testis DNA (Sigma) and 200 mM 
dilhiotreitol (Sigma). After hybridization, the sections were 
rinsed in 1 x SSC, 4x15 min at 55 °C followed by 30 min 
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at room temperature. The sections were then immersed for 
30 min in buffer A (0. 1 M Tris, pH 7.5, I M NaCl, 2 mM 
MgCl 2 ) containing 0.5% bovine serum albumin (Sigma) and 
incubated overnight at 4 °C in the same solution with 
aikalinc-phosphatase conjugated anti-digoxigenin F(ab) 
fragment (1:5000; Bochringcr Mannheim). Afterwards, they 
were rinsed 3 x 10 min in buffer A and twice in buffer B 
(0.1 M Tris, pH 9.5, 0.1 M NaCl, 5 mM MgCl 2 ). Aikaline- 
phosphatase activity was developed by incubating the 
sections with 44 \xl NOT and 33 ul BCiP (Gibco, Gaithers- 
burg, MD) diluted in 10 ml buffer B. The enzymatic 
reaction was stopped by extensive rinsings in distilled water 
and in buffer B, 

The sections were air dried and coated twice with 3% 
collodion dissolved in amy! acetate (LKB, Stockholm, 
Sweden) [52]. They were then dipped into ilford K5 nuclear 
emulsion (Ilford, Mobherly, Cheshire, UK) diluted 1:1 with 
distilled water, exposed for 3 weeks, developed in Kodak 
D19 for 3 min and fixed in Kodak 3000 for 8 min. After 
mounting in glycerol, the sections were analysed in a Nikon 
Microphol-KX microscope equipped with a dark field 
condenser. 

2.9. Statistical analysis 

Results are presented as mean ± SF.M, unless otherwise 
stated. Comparisons between groups were assessed by 
Mann -Whitney Rank Sum test or by one-way ANOVA, 
followed by Duncan's Multiple Comparison procedure. 
When applicable, non-normally distributed variables were 
log transformed before analysis, in order to . get a more 
closely approximated Gaussian distribution; Correlations 
were performed using least square linear regression analysis. 
For the purpose of calculation, undetectable levels of 
galanin (#8, 22) were assigned values of 20 pmol/l. Statis- 
tical significance was set at p<0.Q5. Statistical analyses 
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Fig. 1. Galamn conccnlxulion in 24 pituitary adenomas arranged according 
tn tumor type. Box plots indicate the median and Ihe luwer and upper 
quarliles, the whiskers the 10th and l )0th percentiles, and the outliers are 
shown as circles. 
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Fig. 2. Tumor galanin content related to tumor volume in seven patienis 
with ACTH secreting adenomas. Patients number und age an: given in the 
figure. Dots indicate young patients, triangles old ones. 

were performed using SigmaStat for Windows (Jandcl 
Sientific, Erkarth, Germany). 



3. Results 

3J. Tumor samples 

Galanin-LI was present in extracts from 22 of the 24 
pituitary adenomas examined in this study (Table 1 ; Fig. I ). 
High levels of galanin-LI ranging between 43.5 and 851 
pmol/mg protein (mean 363 pmol/mg) were found in the 
seven adenomas from patients with Cashing *s disease (#1- 
7, Table 2). The six GH-producing adenomas contained low 
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Fig. 3. Tumor galanin content related to tumor ACTH content in 24 
pituitary adenomas specified to tumor type. There is a guod correlation 
between galanin and ACTH tumor concentration (/=0.932; /j<0.001). 
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Fig. 4. Tumor galanin content related to 24 h urinary Cortisol excretion in 
seven patients with ACTH !>ecreLing pituitary adenomas. Exclusion of 
patient #3 results in 0.974; p< 0.00 1. If included r=0.6!4; /><0.142. 



amounts of galanin-LI, from 0.04 lo 1.87 pmol/mg protein 
(mean 0.41 pmol/mg) (#8- -13,- Table 1). Also, the single 
prolactinoma had very low galanin-LI. (#14, Table 1). 

The L0 clinically in active adenomas contained vatying 
amounts of galanin-LI (#15 24, Table 1). Five of these 
adenomas had low galanin content, ranging from 0.13 to 
1.85 pmol/mg (#.17, 20-23, Table 1), three had levels in the 



intermediate range (5.67-17.1 pmol/mg; #15, 18, 24), 
whereas two adenomas were rich in galanin-LI (99.9 and 
199 pmol/mg; #16, 19. respectively). There was no corre- 
lation between galanin levels, or patients age, or sex, 
irrespective of tumor type. Tumor volume did not correlate 
to tumor galanin or ACTH-content in the seven patients 
with Mb dishing (Fig. 2) (r- 0.638; 0.1 23). However, 
the four youngest patients, who also exhibited the highest 
Cortisol per 24 h (#1 4, Table 2) had an inverse relalion 
between tumor-galanin and volume (Fig. 2). 

ACTH was detectable in all adenomas studied, and there 
was a close correlation between galanin-LJ and ACTII levels 
in the adenomas as a group (r= 0,932, /?< 0.001; Fig. 3), as 
well as in the clinically inactive adenomas (/•- 0,9 19, 
/><0.00l), but a less strong correlation among the Gil 
adenomas (/■- 0.8429, p<0.05). The seven corticotropin 
adenomas did not show any correlation between galanin 
and ACTH content (r = 0.434): In this group of patients 24- 
h urinary Cortisol excretion correlated strongly lo tumor 
galanin-LI in six out of seven patients (Fig. 4). In the seventh 
patient (#3) comparatively low tumor galanin-LI and ACTII 
values were associated with high urinary Cortisol. This 
patient had the largest tumor in the Mb Gushing group. No 
correlation was found between urinary Cortisol excretion and 
tumor ACTH levels, nor did plasma ACTH or serum Cortisol 
relate to tumor galanin-LI or ACTH content (Table 2). After 
surgery, all these patients had subnormal Cortisol levels. One 
patient (#2) had rcxuirrcnt disease a few years later. 
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Fig. 5. Sinus pctrosus sampling during CRT! stimulation test in two patients with AC TH^Txxlucing tumors (#4 and 7; Table 1 ). After injection of 100 uy human 
CRH (Corlicobiss^ , Bissendorf, The Netherlands), samples were drawn from right and left cavernous sinus and peripheral vein ;ii - 1, 0, 2. arid 5 min and 
analyzed for ACTH and galanin. 
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3.2. Plasma .samples 



Ga ian in -LI in peripheral blood was measured in all but 
five patients (Table I). The levels ranged from undetectable 
to 130 pmol/I (mean 54.2 ± 30 pmol/1). There was, howev- 
er, no correlation to lumor type, nor to tumor size, galanin- 
LI content, patient age or sex (not shown). 

Galanin-LL in plasma from healthy controls was 93.7 ± 
44 pmol/1 (101.9-48.7 pmbl/1 in the men and 83.9-37.6 
pmol/1 in the women), Le. it was higher in controls than in the 
patient group. These control samples were collected 5 years 
after the patient samples and analysed with a slightly 
modified R1A including Trasylol to further prevent peptide 
degradation. Galanin-LI and ACTH were measured in sinus 
petrosus blood samples from two patients with Cashing 's 
disease (#4, 7) .after CRH stimulation (Fig. 5). A rise in 
plasma ACTH was noted in the left sinus of both patients, 
with no change in peripheral ACTH levels. No effect was 
observed on levels of galanin-LI in sinus petrosus ot in the 
periphery, nor did basal plasma concentrations of galanin-Lf 
in sinus petrosus differ from those in the 'periphery. Thus, 
there was no correspondence to the high levels ofgalanin-LI 
detected in tumor extracts from these two patients. 
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3.3. Chromatography 

Gel permeation chromatography of one tumor (#5, Table 
1) revealed one major peak of galanin-LT with the same 
retention as synthetic human galanin. RP-HPLC of nine 
adenoma extracts (Fig. 6) showed that in eight of these 
tumors the majority of galanin-Ll had the same retention as 
synthetic human galanin (16 min) (Fig. 6A). The main peak 
with galanin-like material was preceded by a smaller one 
with an elution time of 9-1.1 min in six extracts and of 13 
min in two tissue extracts. One adenoma (#6, Table 1) 
differed from the rest, in that there was only one major 
chromatographic peak with galanin-like material with a 
longer retention time (20 min) coinciding with synthetic 
galanin(l-16) (Fig. 6B). 

3.4. In situ hybridization 

The in situ hybridization analysis showed that the pieces 
of the various tumors processed with this technique could be 
divided into at least three groups with regard to galanin 
expression, (i) A galanin signal was found in a large number 
of cells, sometimes apparently all uells of the tissue piece; 
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Fig. 6. (A, R) Reverse-phase high performance liquid cliromatoyraphy of ualariin immunorcactivity in pituitary extracts. (A) Results arc mean ± SEM of eight 
piaiitury adenomas. (R) Results of one pituitary adenoma. (C) Flutiou of human galauin (i - 30). (O) Flmion of galanin 1-16. Please note different scute in (C). 
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Fig. 7. <A-C), Dartcficld micrographs of ACTH-producing adenomas (A, C, #1; B, U2) after hybridization with probes complementary to prepro-galaniii 
mRNA {A, B) and after addition, of excess of cold probe (C). A very strong signal is seen evenly distributed over both ftimoTs (A, B) but only few grains in the 
control (C). Scale bar indicates 50 pm fur all micrographs. 




Fig. 8. (A-E) Darklield (A-C). brightfield (E) and combined dark-and brightfield (D) micrographs of ACTH-producing adenoma (#4) after double- 
hybridization with radioactive probes complementary to prepro-galanin 111RNA (A. C, D) and no n- radioactive probe cotnplccacntory to POMC inRNA (D, fi) 
or after hybridization with an excess of gafcmin cold pmbc (hi). A strong galanin signal (bright grains) is seen in strands within the tumor tissue (A, C), and they 
arc compJcmcnraiy tn the POMC mRNA (dark labelling) distribution (C E). No signal is seen in the control secttou (B). Scale bars indicate 50 pm (A-B; 
OD = E). 
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(ii) a limited number of cells expressed galanin mRNA; and 

(iii) no galanin mRNA expression could be detected. Two 
tumors showed an extremely strong signal that could not be 
blocked by an excess of cold probe, that is the signal was 
considered to be unspeciftc. 

To group (i), i.e. strongly galanin mRNA expressing 
tumors, belonged patients #1 (Fig. 7 A), # 2 (Fig. 7B) and 
#4 (Fig. 8 A, C), whereby one of the two pieces analysed of 
tumor Wl showed a less compact distribution (data not 
shown) and #4 showed a markedly uneven distribution with 



distinct bands of positive cell groups separated by areas with 
a low signal (Fig. 8 A, C). Tumor #5 showed a signal all over 
the sections but of a comparatively low intensity (data not 
shown). To the second group (ii) with a limited number of 
cells expressing galanin mRNA belonged tumors #3 (Fig. 
9 A) and #19 (Fig. 9D): in the latter galanin-positive cells 
often formed small groups (Fig. 9E). In fact, in general the 
tissue piece of this tumor processed for in situ hybridization 
appeared to have a normal pituitary morphology at the 
cellular level (Fig. 9E). In group (iii) no detectable signal 






Fig. y. (A-F) Darkticld (A, D-F), hrightficld (C) arid combined dark-and brighttield (B) micrographs of ACTH-prorfij'cing adenoma (#3) (A-C) and of a 
clinically inactive tumor (#19) (D V) after double-hybridization with radioactive probe complementary to prepro-galanin mRNA and non-radioactive probe 
complementary to POMC mRNA (A-C) or after hybridization with pa^be complementary to prepro-galanin mRNA without (D) or with (E) bisbenzimide 
contrast staining, or after hybridization with an excess of cold probe added to the hybridization cocktail (F). (A-C) Double in situ hybridization reveals several 
cells expressing both prepro-galanin mRNA (bright grains) und POMC mRNA (dark labelling) (small arrowheads) as well as POMC mRNA-posilive and 
prcpro-galauiu uxRN A- negative cells (big arrowheads). (D-F) Note strong signals in restricted areas of the tissue (A) which in B can often be sccu to consist of 
small groups of cells (small arrowheads). Big arrowheads point to negative cell groups. Note low signal in the control experiment (F). Scale bare indicate 50 \un 
(A~B = C; D-E-F). 
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was observed in tumors #10, #16, #17, or #21 (data not 
shown). 

Some tumors were processed for double in situ hybrid- 
ization with a non-radioactive POMC mRNA probe. A high 
degree of coexistence was observed in tumors #1 and #3, 
whereby this was observed in die majority of cells in tumor 
#3, but POMC mRNA-positive, galanin inRNA-ncgalivc 
cells were also observed (Fig. 9A---C). In tumor #1 all 
galanin mRNA positive cells had a POMC mRNA signal. 
Tumor #4 showed a clear separation between the non- 
radioactive POMC mRNA signal and the radioactive galanin 
signal (cf. Fig. 8E with C and D), suggesting synthesis in 
separate cells. However, POMC mRNA-galanin mRNA 
coexistence situations could not be excluded. Finally, in 
some tumors (eg. #10 and #21) a very strong signal was 
obsen'ed after incubation with the galanin mRNA probes. In 
all other tumors described above (Figs.7C, 8B and 9F)> the 
signal could be blocked by addition of an excess (100 x ) of 
cold probe, except in tumors #10 and #21 . They had a very 
strong signal which was observed to the same extent after 
addition of cold probe (data not shown). In agreement, both 
tumors (#10, 21) had very low galanin levels as monitored 
with RIA. 



4. Discussion 

In this study, using a RiA for human galanin, we demon- 
strate high levels of this peptide in extracts from ACTII- 
producing pituitary adenomas, moderate and varying galanin 
levels in clinically non-functioning adenomas arid low or not 
detectable levels in GH-and PRL -producing adenomas, with 
a good correlation to ACTH levels in the adenomas. More- 
over, the RIA data were supported by the in situ hybridiza- 
tion results; a significant galanin signal was only found in 
the corticotroph adenomas. Double in situ hybridization 
revealed coexistence of galanin mRNA and POMC mRNA 
in most tumors, although a clear cellular dissociation of the 
two transcripts was seen in some other tumours; 

When discussing these results, one should keep in mind 
that different parts of the tumor tissue sample were analyzed 
in each of the various assays. Thus, differences in distribu- 
tion of various markers within the tumor should be consid- 
ered. Such differences were, in fact, encountered in the in 
situ hybridization analysis. 

4.1. Galanin in. corticotroph adenomas 

A number of studies, using mainly immunostaining and 
in situ hybridization, have demonstrated presence of galanin 
in pituitary adenomas, especially in ACTH-producing 
tumors (see Table 3 for overview). Here we show thai 
galanin levels in pituitary adenoma extracts vary with tumor 
type and correlate strongly with ACTH levels, supporting 
previous studies showing co-production of tJie two peptides. 
Indeed, the present in situ hybridization results using 



Table 3 

Galanin in human pituitary adenomas and in normal pituirarics 



Reference 


Pituitary adenomas 
ACTH GH 


PRL 


Inactive 


Normal 
piaiitarics 


Hulting et al. [37] 




1/1 








Vrontakis ct at. [38] 


13/18 


0/4 


0/8 


0/5 


23/23 


Hsu el al. [40] 


16/19 


5/11 


2/14 


9/18 


7/7 


Polak ct a!. [41] 


6/6 


3/8 


2/2 • 


0/7 




Sano et al. [42] 


9/14 


0/10 


0/9 


0/9 




Bennet et a!. [39] 




2/4 


0/1 


3/13 


30/30 


Invitti et al. [431 


5/5 










Leung et al. [44] 


10/16 


1/26 


1/19 


25/89 


6/6 



Review of data published 1989-2002. 



double-labeling on single sections show conclusively that 
individual cells can synthesize both ACTFI and galanin. 
However, whereas coexistence of galanin and POMC 
mRNA was found in apparently all cells of one corticotroph 
adenoma (#1, 2 ? Table 1; Fig. 7 A, B), another tumor of this 
, type showed a clear separation of the I wo signals (#4, Table 
1; Fig. #A, C E). Moreover, co-expression of galanin and 
POMC mRNA hi the ACTH adenomas was reflected in a 
close correspondence between Lumor content of galanin and 
ACTH, whereas when the peptides were expressed in 
separate cells RIAble tumor content of ACTH vastly 
exceeded that of galanin: Thus, absence of a correlation 
between these two peptides in extracts from corticotroph 
adenomas may reflect an expression of ACTH and galanin 
in different cell populations. 

A clinical significance of this differential expression 
could .not be related to age. sex, tumor size or pituitary 
function. However^ as shown by Invitti et al. [43], the in vivo 
release of galanin in response to CRH varies among patients 
with Cushing's. disease. We found, no. elevation of plasma 
galanin in sinus petrosus from two patients with Cushing's 
disease. Thus, although both patients showed a marked 
unilateral response to CRH, there was no similar effect on 
galanin levels. One of these patients (#4) was analyzed with 
double-labeling in situ hybridization, and in this patient 
galanin and ACTH were synthesized in distinctly separate 
cell populations (Fijg. 8), Perhaps the variability in the 
response to CRH can be explained by the extent of coloc- 
alization of ACTH and galanin. Thus, when the peptides are 
produced in different cells, CRH may. only cause release of 
ACTH. It would therefore be interesting to know whether or 
not type-1 CRH receptors [53] are present in ACTH-nega- 
tive, galanin-positive cells. It should here be mentioned that 
in the normal state, stimulation of ACTH secretion with 
CRH, desmopressin or physical exercise does not affect 
plasma galanin levels [43,54,55]. 

The clinical significance of the presence of galanin in 
ACTH adenomas was also investigated by Leung et al. [44], 
showing that galanin-positive adenomas were smaller and 
had better prognosis than those not expressing galanin. The 
present study supports this finding; we have found that al! 
seven corticotroph adenomas were galanin-positive, and 
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ihey were small and were successfully removed. Interest- 
ingly, in (he four young patients with microadenomas and 
highly active Mb Cushing (#1-4) tumor galaiiin was 
inversely related to tumor volume. In contrast to Leung et 
al. [44J we found a correlation in six of seven corticolroph 
adenomas between galanin content and 24 h urinary Cortisol 
excretion, representing integrated ACTH-cortisol produc- 
tion.' Lo what extent galanin can serve as a marker for tumor 
activity as well as act as a growth inhibitory factor in this 
type of rumor requires analysis of a larger material. 



preoperative ACTH failure {#10, 20, 21, 23) had low levels 
(mean 0.61 pmol/mg protein). 

The patient (#16) with the highest tumor galanin content 
among the clinically inactive adenomas had shown clinical 
signs of Mb Cushing. However, laboratory investigation, 
performed after a tumor apoplexy, revealed normal ACTH 
and Cortisol secretion. This patient had a recurrence of 
Cu siting's diseas a lew years after surgery. Our sample 
may have contained ACTH adenomatous tissue or normal 
pituitary as indicated by itnmunocytochemistry. 



4.2. Galanin in clinically inactive GH and PR1, adenomas 4.3. Identity of galanin in pituitary tumors 



m 



The capacity of non-ACTH tumors to synthesize galanin 
has been controversial. Vrontakis ct al. [38] and Sano et al. 
[42 j found galanin-LI exclusively coexpressed with ACTH 
and not present in any other type of pituitary adenoma. We 
have previously demonstrated the presence of galanin in a 
GH adenoma [37], and Polak et al. [41] in GH and PRL 
adenomas. Hsu et al. [40], in. addition, found galanin-LI in 
half of the non-functioning : adenomas, including a few 
which were negative for ACTH. In the largest and most 
recent study, Leung et al. [44] demonstrated galanin-LI in 
one third of the clinically inactive adenomas but rarely in 
GH-or PRL-producing adenomas. 

In the only previous study based on extraction, Bennet et 
al. [39], using an antiserum against porcine galanin and a 
porcine tracer, -found detectable galanin in two of four GH 
adenomas and in three of thirteen clinically active adeno- 
mas. With the present extraction method we find gaianin-Li 
in all but two of the non-ACTH adenomas studied. How- 
ever, galanin levels above 5 pmol/mg protein were only seen 
in 5 of the 10 clinically inactive adenomas and in none of 
the GH- or PRL-producing adenomas. 

The presence of galanin was shown in all normal 
pituitaries examined in previous studies [38-40,44]. In fact, 
extraction yielded markedly higher galanin concentrations 
in normal pituitaries than in non-ACTH pituitary adenomas 
[39]. Since the present study is based on tissue obtained at 
pituitary surgery, it does not include a control group of 
normal pituitaries. However, in a previous study of extracts 
from 2 SO postmortem pituitaries Schmidt et al. [5] reported 
410 pmol galanin per gram frozen tissue. In our material the 
mean galanin content was 13,950 pmol/g frozen tissue in 
ACTH adenomas, 3.6 pmol in GH adenomas and 1.4-4170 
pmol in clinically inactive adenomas. 

The highly variable galanin content found in the clinically 
inactive adenomas correlated wel| with ACTH content in the 
same adenomas, and presence of normal corticotrophs in the 
samples containing high galanin and ACTH cannot be 
excluded. In one case (#19) this was verified by PAD 
showing normal pituitary tissue in 9 of 10 samples (cf. Fig. 
9D-F). This patient also incurred a postoperative panhypo- 
pituitarism. Indeed, the three patients who became ACTH 
deficient after surgery (#18, 19, 24) had elevated ' tumor 
galanin-LT (mean 40.9 pmol/mg protein), whereas those with 



The molecular form of galanin was studied with RP- 
HPLC in the seven corticolroph adenomas and the two 
clinically inactive adenomas with high galanin content (#L- 
7) (#16, 19). The elution pattern was shown to be remark- 
ably homogenous with a small peak preceding the main 
form eluting with synthetic human galanin (Fig. 5 A). It 
agrees with the pattern found in non-ACTH pituitary ade- 
nomas by Bennet et al. [39] and corresponds to the structure 
determined by Schmidt ct al [5] who isolated and se- 
quenced galanin from puriiied extracts of postmortem 
pituitaries. Our finding indicates that the molecular form 
of galanin in eorticotroph adenomas is identical to that 
found in clinically inactive tumors and in the normal 
pituitary. The small polar peak eluting earlier than gala- 
nin(l -30) probably represents degradation products which 
have been shown to elute prior to the main form [56], In one 
case (#6) we found that galanin eluted later than human 
standard with a peak coinciding with galanin(I - 16) (Fig. 
5B). This fragment has not been found in vivo in die rat 
[56,57] or in man [2,5,28,58], and bur finding suggests a 
different processing of tumor galanin in this patient. 

4.4. Concluding remarks 

We found that galanin is present, in human eorticotroph 
adenomas and provide conclusive evidence at the transcript 
level for cellular colocalization with POMC in some, but 
not all tumors. Our results also indicate that galanin 
present in other types of pituitary adenomas is associated 
with normal pituitary tissue. The molecular form of tu- 
morous pituitary galanin is homogenous representing gen- 
uine human galanin (1-30), but in one adenoma a form of 
galanin coeluting with galanin(l 16) was identified. No 
evidence of galanin secretion from these adenomas into 
local or peripheral circulation was obtained, pointing to a 
mainly auto-and/or paracrine role for galanin produced in 
corticoptroph adenomas. 
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CHAPTER 29 



Regulation of transcription 



The phenolypic differences that distinguish the 
various kinds or cells in a higher eukaryote are 
largely due »o differences in ihe expression or 
renes that code for proteins, that is. those tran- 
scribed by RXA polymerase II. In principle, the 
expression of these penes' might he regulated at 
an y one of several stages. The concepi of the 
•level of control" implies that gene expression 
is not necessarily an automatic process once it 
has begun. It could be regulated in a gene- 
specific way at any one of several sequential 
steps. >Ve can distinguish (al least) five poten- 
tial control points, forming the series: 

Activation of gene structure 

A • ' • 

Initiation of transcription 

i 

Processing the transcript 
A 

Transport, to cvtoplosm 

A ' 
Translation of mKNA 

The existence of Ihe first step is implied by 
the discover}* (hat genes may exist in either of 
I wo structural conditions. Relative to the slate 
of most of Ihe. genome, genes are found in 
an Mclive" state in the cells in which they 
are expressed (see Chapter 27). The change of 
structure i$ distinct from Ihe act of transcrip- 
tion, and indicates that the gene is u transcrib- 
able." This suggests that acquisition of the 
"active* 1 structure must be the first step in gene 
expression. 

Transcription of a gene in the active state is 



controlled al the stage or initiation, that is. by 
the interaction of RXA polymerase with its pro- 
moter. This is now becoming susceptible to 
analysis in the fa. vitro systems (see Chapter 
28). For most genes, this is a major control 
point; probably it is the most common level or 
regulation. 

There is at present no evidence for control 
at subsequent stages of transcription In etikary- 
otic cells, for example, via antiterminaUon 
mechanisms. 

The primary transcript is modified by capping 
at the 5' end, and usually also by polyadenyla- 
tion at the 3' end. Introns must be spliced out 
from the transcripts of interrupted genes. The 
mature UNA must he exported Oom the nucleus 
to the cytoplasm. Regulation or gene expression 
by selection or sequences at the level of nuclear 
RXA might involve any or alt of these stages, 
but the one for which we have most evidence 
concerns changes in splicing: some genes are 
expressed by means of alternative splicing pat- 
terns whose regulation controls the lype or pro- 
tein product (see Chapter 30). 

Finally, the translation or nn mRNA In the cyto- 
plasm can be specifically controlled. There is little 
evidence for Ihe employment or this mechanism in 
adult somatic cells. bu| it does occur in some 
embryonic situations, as described in Chapter 7. 
The mechanism is presumed to involve the block- 
ing or initiation or translation or some mRNAs by 
specific protein factors. ' 

But having acknowledged that control or gene 
expression can occur at multiple stages, and 
that production or RNA cannot inevitably be 
equated with production or protein, it is clear 



* I c 
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thai the overwhelming majority of regulatory 
events occur at the initiation of transcription. 
Regulation of tissue-specific gene transcription 
lies at the heart of eukaryotic differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins that regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcriptioh factor serves to provide 



common control or a large number of target 
genes, and we seek to answer two questions 
about this mode of regulation: what identifies 
the common target genes to the transcription 
factor; and how is the activity of the transcrip- 
tion factor itself regulated in response to intrin- 
sic or extrinsic signals? 



Response elements identify genes under common 
regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
the)' share a ptvmoter element that is recognized 
by a regulatory transcription /actor. An element 
that causes a gene to respond to such a factor 
is called a response element; examples are. the 
HSE (heat shock response element), GRE 
(glucocorticoid response element), SRE (serum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in. Table 29*1. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
extends for* a short distance on either side of 



Table 29.1 Inducible transcription (actors bind to 
response elements that identify groups ol promoters 
or enhancers subject to coordinate control. 



Regulatory Agent Module Consensus 



Factor 



Heat shock HSE CNNGAANNTCCNNG HSTF 

Glucocorticoid GRE TGGTACAAATGTTCT Receptor 

Phorbol ester TRE TGACTCA API 

Serum SHE CCATATTAGG SRF 



the consensus sequence. In promoters, the ele- 
ments are not present at fixed distances from 
the startpoint, but are usually <200 top upstream 
of it. The presence or a single element usually 
is sufficient to confer the regulatory respond, 
but sometimes there are multiple copies. 

Response elements may be located in P 1 *" 
moters or in enhancers. Some types of elements 
are typically found in one rather than the other, 
usually an HSE is found in a promoter, while a 
GRE is found in an enhancer. We assume iW. 
all response elements function by the same 
general principle. A gene is regulated f 
sequence at the promoter or enhancer thai tf 
recognized by a specific protein. The P™ 1 **!* 
functions as a transcription /actor needed J ^ 
BKA polymerase to initiate. Active protein 
available only under conditions when *kt.&^ 
to be expressed; its absence means that the P 
moter is not activated by this particular ci W* 

An example of a situation in which * • 
genes are controlled by a single factor Is ^ 
\ided by the heal shock response. This Is ^ 
mon to a wide range of prokaryotes j 
eukaryotes and involves multiple contra 
gene expression: an increase in temp cra 0 „ 
turns ofT transcription of some genes. a|l d 
transcription of the heat shock 
causes changes in the translation of *** ^r* 
The control of the heal shock genes ilj u f l 0l ,d 
the differences between prokaryoUc ^ 
eukaryotic modes of control. In. bacterid ^ 
sigma factor is synthesized thai «B rcC * 5 
polymerase holoenzyme to recognize 
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Abstract 

Translation initiation is regulated in response to 
nutrient availability and mitogenic stimulation and is 
coupled with cell cycle progression and cell growth. 
Several alterations in translational control occur in 
cancer. Variant mRNA sequences can alter the 
translational efficiency of individual mRNA molecules, 
which in turn play a role in cancer biology. Changes in 
the expression or availability of components of the 
translational machinery and in the activation of 
translation through signal transduction pathways can 
lead to more global changes, such as an increase in 
the overall rate of protein synthesis and translational 
activation of the mRNA molecules involved in cell 
growth and proliferation. We review the basic 
principles of translational control, the alterations 
encountered in cancer, and selected therapies 
targeting translation initiation to help elucidate new 
therapeutic avenues. 

Introduction 

The fundamental principle of molecular therapeutics in can- 

• cer is to exploit the differences in gene expression between 
cancer cells and normal cells. With the advent of cDNA array 
technology, most efforts have concentrated on identifying 
differences in gene expression at the level of mRNA, which 
can be attributable either to DNA amplification or to differ- 
ences in transcription. Gene expression is quite complicated, 
however, and is also regulated at the level of mRNA stability, 
mRNA translation, and protein stability. 

The power of translational regulation has been best recog- 
nized among developmental biologists^ because transcription . 
does not occur in early embryogenesis in eukaryotes. For ex- 

• ample, in Xenopus, the period of transcriptional quiescence 
continues until the embryo reaches midbJastula transition, the 
4000-cell stage. Therefore, all necessary mRNA molecules are 
transcribed during oogenesis and stockpiled in a transtetionally 
inactive, masked form. The mRNA are translationalry activated 
at appropriate times during oocyte maturation, fertilization, and 
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early embryogenesis and thus, are under strict franslational 
control. 

Translation has an established role in cell growth. Basi- 
cally, an increase in protein synthesis occurs as a conse- 
quence of mitogenesis. Until recently, however, little was 
known about the alterations in mRNA translation in cancer, 
and much is yet to be discovered about their rote in the 
development and progression of cancer. Here we review the 
basic principles of translational control, the alterations en- 
countered in cancer, and selected therapies targeting transte- . 
tion initiation to elucidate potential new therapeutic avenues. 

Basic Principles of Translational Control 
Mechanism of Translation initiation 
Translation Vitiation is the main step in translational regulation. 
Translation initiation is a complex process in which the initiator 
tRNA and the AOS and 60S ribosomal subunits are recruited to 
the 5' end of a mRNA molecule and assembled by eukaryotic 
translation initiation factors into an 80S ribosome at the start 
codon of the mRNA {Rg. 1). The 5' end of eukaryotic mRNA is 
capped, /.a, contains the cap structure m 7 GpppN (7-methyl- 
guarK)sine-tr^rK)spho-5'-ribonucleoside). Most translation in 
eukaryotes occurs in a cap-dependent fashion, i.e., the cap is 
specifically recognized by the elF4E, 3 which binds the 5' cap. 
The elF4F translation initiation complex is then formed by the 
assembly of elF4E, the RNA helicase elF4A, and eIF4G, a 
scaffolding protein that mediates the binding of the 40S ribo- 
somal subunit to the mRNA molecule through interaction with 
the elF3 protein present on the 40S ribosome. elF4A and eIF4B 
participate in melting the secondary structure of the 5' UTR of 
the mRNA The 43S initiation complex (40S/dF2/MeMRNA/ 
GTP complex) scans the mRNA in a 5'~*3' direction until it 
encounters an AUG start codon. This start codon is then base- 
paired to the anticodort of initiator tRNA, forming the 46S initi- 
ation complex. The initiation factors are then displaced from the 
48S complex, and the 60S ribosome joins to form the 80S 
nTx>some. 

Unlike most eukaryotic translation, translation initiation of 
certain mRNAs, such as the picomavirus RNA is cap inde-* 
pendent and occurs by internal ribosome entry. This mecha- 
nism does not require eJF4E. Either the 43S complex can bind 
the initiation codon directly through interaction with the IRES in 
the 5' UTR such as in the encephatomyocarditis virus, or it can 



3 The abbreviations used are: elF4E. eukaryotic initiation factor 4E; UTR, 
untranslated region; IRES, internal ribosome entry site; 4E-BP1, eukaryofc 
initiation factor 4E-binding protein t; S6K, ribosomal p70 S6 kinase; mTOR. ' 
mammalian target ol rapamycin; ATM, ataxia telangiectasia mutated; Pt3K, 
phosphatxjytaositoJ 3- kinase; PTEN. phosphatase and tensin homolog de- 
leted from chromosome 10; PP2A, protein phosphatase 2A; TGF-03, trans- 
forming growth factor-03; PAP, porylA) polymerase; EPA, ekrosopentaencte 
arid; mda-7 r melanoma dWereiitiaiiorr-associated gene 7. 
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Ftg. J. Translation initiation in eukaryotes. The 4E-BPs are hyperphos- 
phorylated to release etF4E so that it can interact with the 5' cap, and the 
elF4F initiation complex is assembled. The interaction or pofy(A) binding 
protein with the initiation complex and cvcularization ol the mRNA is not 
depicted in the diagram. The secondary structure of the 5' UTH is melted, 
the 4 OS ribosomal subunil is bound to e(F3, and the ternary complex 
consisting of elF2. GTP. and the Met-tRNA are recruited to the mRNA. The 
ribosome scans the mRNA in a 5'->3' direction until an AUG start codon 
is found in the appropriate sequence context The initiation factors are 
released, and the targe ribosomal subunit is recruited. 

initially attach to the IRES and then reach the initiation cotton by 
scanning or transfer, as is the case with the pofo virus (1). 

Regulation of Translation Initiation 
Translation initiation can be regulated by alterations in the 
expression or phosphorylation status of the various factors 
involved. Key components in translation^ regulation that 
may provide potential therapeutic targets follow. 

eIF4E. eIF4E plays a central rote in translation regulation. 
It is the least abundant of the initiation factors and is con- 
sidered the rate-limiting component for initiation of cap- 
dependent translation. elF4E may also be involved in mRNA 
splicing, mRNA 3' 'processing, and mRNA nucleocytopias- 
mic transport (2). elF4E expression can be increased at the 
transcriptional level in response to serum or growth factors 
(3). elF4E overexpression may cause preferential translation 
of mRNAs containing excessive secondary structure in their 
5' UTR that are normally discriminated against by the trans- 



lations machinery and thus are inefficiently translated (4-7). 
As examples of this, overexpression of elF4E promotes in- 
creased translation of vascular endothelial growth factor, 
fibroblast growth factor-2, and cyclin D1 (2, 8, 9). 

Another mechanism of control is the regulation of elF4E 
phosphorylation. elF4E phosphorylation is mediated by the 
rnitogen-activated protein kinase-interacting kinase 1, which 
is activated by the rnitogen-activated pathway, activating 
extracellular signal-related kinases and the stress-activated 
pathway acting through p38 rnitogen-activated protein ki- 
nase (10-13). Several mitogens, such as serum, platelet- 
derived growth factor, epidermal growth factor, insulin, 
angiotensin II, sre kinase overexpression, and ras over- 
expression, lead to eIF4E phosphorylation (14), The phos- 
phorylation status of elF4E is usually correlated with the 
transitional rate and growth status of the ceft; however, 
elF4E phosphorylation has also been observed In response 
to some cellular stresses when translations! rates actually 
decrease (15). Thus, further study is needed to understand 
the effects of elF4E phosphorylation on elF4E activity. 

Another mechanism of regulation is the alteration of elF4E 
availability by the binding of elF4E to the eIF4E-binding pro- 
teins (4E-BP, also known as PHAS-f). 4E~BPs compete with 
eJF4G for a binding site in elF4E. The binding of elF4E to the 
best characterized elF4E-b!nding protein, 4E-BP1, is reguv 
lated by 4E-BP1 phosphorylation. Hypophosphorylated 4E- 
BP1 binds to elF4E, whereas 4E-BP1 hyperphosphorytation 
decreases this binding. Insulin, angiotensin, epidermal 
growth factor, platelet-derived growth factor, hepatocyte 
growth factor, nerve growth factor, insulin-like growth factors 
. I and II, ihterleukin 3, granulocyte-macrophage colony-stim- 
ulating factor + steel factor, gastrin, and the adenovirus have 
all been reported to induce phosphorylation of 4E-BP1 and 
to decrease the ability of 4E-BP1 to bind elF4E (15, 16). 
Conversely, deprivation of .nutrients or growth factors result** 
in 4E-BP1 dephosphorylation, an increase in elF4E binding, 
and a decrease in cap-dependent translation. 

p70 S6 Kinase. Phosphorylation of ribosomal 40S protein 
S6 by S6K is thought to play an important role in translation! 
regulation. S6K -/- mouse embryonic cells proliferate more 
slowly than do parental cells, demonstrating that S6K has a 
positive influence on cell proliferation (17). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
ofigopyrimidine tract (5' TOP| found at the 5' UTR of ribosomal 
protein mRNAs and other mRNAs coding for components of 
the transnational machinery. Phosphorylation of S6K is regu- 
lated in part based on the availability of nutrients (1 8, 1 9) and is 
stimulated by several growth factors, such as platelet-derived 
growth factor and insulin-like growth factor I (20). 

elF2« Phosphorylation. The binding of the initiator tRNA 
to the small ribosomal unit is mediated by translation initia- 
tion factor elF2. Phosphorylation of the tr-subunit of elF2 
prevents formation of the el F2/GTP/Met-tRN A complex and 
inhibits global protein synthesis (21, 22). elF2o is phospho- 
rylated under a variety of conditions, such as viral infection, 
nutrient deprivation, heme deprivation, and apoptosis (22). 
e!F2a is phosphorylated by heme-reguiated inhibitor, nutrient- 
regulated protein kinase, and the lFN^induced, double- 
stranded RNA-activated protein kinase (PKR; Ref. 23). 
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The mTOR Signaling Pathway. The roacrolide antibiotic 
rapamycin (Siralimus; Wyeth-Ayerst Research, CollegeviJIe, 
PA) has been the subject of intensive study because it in- 
hibits signal transduction pathways involved in T-cell activa- 
tion. The rapamycin-sensitive component of these pathways 
is mTOR (also called FRAP or RAFT1). mTOR is the mam- 
malian homologue of the yeast TOR proteins that regulate G, 
progression and translation in response to nutrient availabil- 
ity (24). mTOR is a serine-threonine kinase that modulates 
. translation initiation by altering the phosphorylation status of 
4E r BP1 and S6K (Fig. 2; Ref. 25). 

4E-BP1 is phosphoryiated on multiple residues. mTOR phos- 
pborylates the Thr-37 and Th>-46 residues of 4E-BP1 in vitro 
(26); however, phosphorylation at these sites is not associated 
with a loss of elF4E binding. Phosphorylation of Thr-37 and 
* Thr-46 is required for subsequent phosphorylation at several 
COOrf terminal, serum-sensitive sites; a combination of these 
phosphorylation events appears to be needed to inhibit the 
binding of 4E-BP1 to elF4E (25). The product of the ATM gene, 
p3a/MSK1 pathway, and protein kinase O also play a rote in 
4E-BP1 phosphorylation (27-29). 

S6K and 4E-BP1 are also regulated, in part, by PI3K and its 
downstream protein kinase Akt. PTEN is a phosphatase that 
negatively regulates PI3K signaling. PTEN null cells have 
constitutivery active of Akt, with increased S6K activity and 
S6 phosphorylation (30). S6K activity is inhibited both by 
PI3K inhibitors wortmannin and LY294002 and by mTOR 
inhibitor rapamycin (24). Akt phosphoryfates Ser -2448 in 
mTOR in vitro, and this site is phosphoryiated upon Akt 
activation in vivo (31-33). Thus, mTOR is regulated by the 
PI3K/Akt pathway; however, this does not appear to be the 
only mode of regulation of mTOR activity. Whether the PI3K 
pathway also regulates S6K and 4E-BP1 phosphorylation 
independent of mTOR is controversial. 

Interestingly, mTOR autophosphorylation is blocked by wort- 
mannin but not by rapamycin (34). This seeming inconsistency 
suggests that mTOR-resportsive regulation of 4E-BP1 and S6K 
activity occurs through a mechanism other than intrinsic mTOR 
kinase activity. An alternate pathway for 4E-BP1 and S6K phos- 
phorylation by mTOR activity is by the inhibition of a phospha- 
tase. Treatment with cafycufin A, an inhibitor of phosphatases 1 
and 2A, reduces rapamycin-induced dephosphorytation of 4E- 
BPt and S6K by rapamycin (35). PP2A interacts with futMength 
S6K but not with a S6K mutant that is resistant to dephospho- 
rylation resulting from rapamycin. mTOR phosphorylates PP2A 
in vitro: however, how this process alters PP2A activity is not 
known. These results are consistent with the model that phos- 
- phorytation of a phosphatase by mTOR prevents dephospho- 
rybtion of 4E-8P1 and S6K, and conversely, that nutrient dep- 
rivation and rapamycin block inhibition of the phosphatase by 
mTOR. 

Poryaderrylation. The pory(A) tail in eukaryotfc rnRNA is 
important in enhancing translation initiation and rnRNA sta- 
bility. Pofyadbnylation plays a key role in regulating gene 
expression during oogenesis and early embryogenesis. 
Some rnRNA that are translationally inactive in the oocyte are 
poryadenylated concomitant^ with translational activation in 
oocyte maturation, whereas other mRNAs that are transla- 
tionally active during oogenesis are deadenylated and trans- 
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F®. 2. Regulation of translation initiation by signal transduction path- 
ways. Signaling via p38. extracellular signal-related kinase, P!3K. and 
mTOR can alt activate translation initiation. 



lationally silenced (36-38). Thus, control of poty(A) tail syn- 
thesis is an important regulatory step in gene expression. 
The 5' cap and poly (A) tail are thought to function synergis- 
ticaJty to regulate rnRNA translational efficiency (39, 40). 

RNA Packaging. Most RNA-binding proteins are assem- 
bled on a transcript at the time of transcription, thus deter- 
mining the translational fate of the transcript (41). A highly 
conserved family of Y-box proteins is found in cytoplasmic 
messenger ribonucleoprotein particles, where the proteins 
are thought to play a role in restricting the recruitment of 
rnRNA to the translational machinery (41-43). The major 
mRNA-associated protein, YB-1, destabilizes the interaction 
of elF4E and the 5' rnRNA cap in vitro, and overexpression of 
YB-1 results in translational repression in vivo (44). Thus, 
alterations in RNA packaging can also play an important role 
in translational regulation. 

Translation Alterations Encountered in Cancer 
Three main alterations at the translational level occur in cancer: 
variations in rnRNA sequences that increase or decrease trans- 
lational efficiency, changes in the expression or availability of 
components of the translational machinery, and activation of 
translation through aberrantly activated signal transduction 
pathways. The first alteration affects the translation of an indi- 
vidual rnRNA that may play a rote in carcinogenesis. The sec- 
ond and third alterations can lead to more global changes, such 
as an increase in the overall rale of protein synthesis, and the 
translational activation of several rnRNA species. 

Variations in rnRNA Sequence 

Variations in rnRNA sequence affect the translational effi- 
ciency of the transcript. A brief description of these variations 
and examples of each mechanism follow. 

Mutations. Mutations in the rnRNA sequence, especially 
in the 5' UTR, can alter its translational efficiency, as seen in 
the following examples. 
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c-myc Saito et ai proposed that translation of full-length 
c-myc ts repressed, whereas in several Burkitt lymphomas 
that have deletions of the mRNA 5' UTR, translation of c-myc 
is more efficient (45). More recently, it was reported that the 
5' UTR of c-myc contains an IRES, and thus c-myc transla- 
tion can be initiated by a cap-independent as well as a 
cap-dependent mechanism (46, 47). In patients with multiple 
myeloma, a C-»T mutation in the c-myc IRES was identified 
(48) and found to cause an enhanced initiation of translation 
via internal ribosomal entry (49). 

BRCA1, A somatic point mutation (1 17 G-»C) in position 
-3 with respect to the start codon of the BRCA1 gene was 
identified in a highly aggressive sporadic breast cancer (50). 
Chimeric constructs consisting of the wild-type or mutated 
BRCA1 5' UTR and a downstream luciferase reporter dem- 
onstrated a decrease in the translation^ efficiency with the 5' 
UTR mutation. 

Cyclin-dependent Kinase Inhibitor 2A Some inherited 
melanoma kindreds have a G-»T transversion at base -34 
of cyclin-dependent kinase inhibHor-2A, which encodes a 
cyclin-dependent kinase 4/cyclin-dependent kinase 6 kinase 
inhibitor important in G, checkpoint regulation (51). This 
• mutation gives rise to a novel AUG translation initiation 
codon, creating an upstream open reading frame that com- 
petes for scanning ribosomes and decreases translation 
from the wild-type AUG. 

Alternate Splicing and Alternate Transcription Start 
Sites. Alterations in splicing and alternate transcription sites 
can lead to variations in 5' UTR sequence, length, and second- 
ary structure, ultimately impacting translational efficiency. 

ATM. The ATM gene has four noncoding exons in its 5' 
UTR that undergo extensive alternative splicing (52). The 
contents of. 12 different 5' UTRs that show considerable 
diversity in length and sequence have been identified. These 
divergent 5' leader sequences play an important role in the 
translational regulation of the ATM gene. 

mdm. In a subset of tumors, overexpression of the onco- 
protein mdm2 results in enhanced translation of the mdm2 
mRNA Use of different promoters leads to two mdm2 tran- 
scripts that differ only in their 5' leaders (53). The longer 5' 
UTR contains two upstream open reading frames, and this 
rnRNA is loaded with ribosomes inefficiently compared with 
the short 5' UTR. 

BRCA1, In a normal mammary gland, BRCA1 mRNA is 
expressed with a shorter leader sequence (5'UTRa), whereas 
. in sporadic breast cancer tissue; BRCA 1 mRNA is expressed 
with a longer leader sequence (5' UTRb); the translational 
efficiency of transcripts containing 5' UTRb is 1 0 times lower 
than that of transcripts containing 5' UTRa (54). 

TGF-f$3. TGF~p3 mRNA includes a 1.1-kb 5' UTR, which 
exerts an inhibitory effect on translation. Many human breast 
cancer cell lines contain a novel TGF-03 transcript with a 5' 
UTR that is 870 nucleotides shorter and has a 7-fold greater 
translational efficiency than, the normal TGF-p3 mRNA (55). 

Alternate Poryadenyjation Sites. Multiple poryadenyl- 
ation signals leading to the generation of several transcripts 
with differing 3' UTR have been described for several mRNA 
species, such as the RET prolo-oncogene (56), ATM gene 
(52), tissue inhibitor of metal!opro1einases-3 (57). RHOA 



proto-oncogene (58), and caimodulin-l (59). Although the 
effect of these alternate 3' UTRs on translation is not yet 
known, they may be important in RNA-protein interactions 
that affect translational recruitment. The role of these alter- 
ations in cancer development and progression is unknown. 



Alterations in the Components of the 
Translation Machinery 

Alterations in the components of translation machinery can 
take many forms. 

Overexpressston of eIF4E Overexpression of elF4E 
causes malignant transformation in rodent cells (60) and the 
deregulation of HeLa cell growth (61). Polunovsky et al. (62) 
found that elF4E overexpression substitutes for serum and 
individual growth factors in preserving viability of fibroblasts, 
which suggests that elF4E can mediate both proliferative and 
survival signaling. 

Elevated levels of elF4E mRNA have been found in a broad 
spectrum of transformed cell lines (63). e)F4E levels are 
elevated in all ductal carcinoma in situ specimens and inva- 
sive ductal carcinomas, compared with benign breast spec- 
imens evaluated with Western blot analysis ($4, 65). Prelim- 
inary studies suggest that this overexpression is attributable 
to gene amplification (66). 

There are accumulating data suggesting that elF4E overex- 
pression can be valuable as a prognostic marker. eIF4E over- 
expression was found in a retrospective study to be a marker of 
poor prognosis in stages I to III breast carcinoma (67). Verifica- 
tion of the prognostic value of elF4E in breast cancer is now 
under way in a prospective trial (67). However, in a different 
study, elF4E expression was correlated with the aggressive 
behavior of non-Hodgkin*s lymphomas (68). In a prospective 
analysis of patients with head and neck cancer, elevated levels 
of elF4E in histologically tumor-free surgical margins predicted 
a significantly increased risk of local-regional recurrence (9). 
These results all suggest that elF4E overexpression can be 
used to select patients who might benefit from more aggressive 
systemic therapy. Furthermore, the head and neck cancer data 
suggest that elF4E overexpression is a field defect and can be 
used to guide local therapy. 

Alterations in Other Initiation Factors. Alterations in a 
number of other initiation factors have been associated with 
cancer. Overproduction of elF4G, similar to elF4E, leads to 
malignant transformation in vitro (69). eIF-2a is found in 
increased levels in or onchioloalveolar carcinomas of the lung 
(3). Initiation factor elF-4A1 is overexpressed in melanoma 
(70) and hepatocellular carcinoma (71). The p40 subunil of 
translation initiation factor 3 is amplified and overexpressed 
in breast and prostate cancer (72), and the elF3-p1 10 subunit 
is overexpressed in testicular seminoma (73). The role that 
overexpression of these initiation factors plays on the devel- 
opment and progression of cancer, if any, is not known. 

Overexpression of S6K. S6K is amplified and highly 
overexpressed in the MCF7 breast cancer cell line, com- 
pared with normal mammary epithelium (74). In a study by 
Barlund ef al (74), S6K was amplified in 53 of 668 primary 
breast tumors, and a statistically significant association was 
observed between amplification and poor prognosis. 
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Overexpression of PAP. PAP catalyzes 3' poly(A). syn- 
thesis. PAP is overexpressed in human cancer cells com- 
pared with norma) and virafly transformed ceils (75). PAP 
enzymatic activity in breast tumors has been correlated with 
PAP protein levels (76) and, in mammary tumor cytosols, was 
found to be an independent factor for predicting survival (76). 
Little is known, however, about how PAP expression or ac- 
tivity affects the translational profile. 

Alterations in RNA-binding Proteins. Even less is known 
about alterations in RNA packaging in cancer. Increased ex- 
pression and nuclear localization of the RNA-binding protein 
YB-1 are indicators of a poor prognosis for breast cancer (77), 
non-small cell lung cancer (78), and ovarian cancer (79). How- 
ever, this effect may be mediated at least in part at the level of 
transcription, because YB-1 increases chemoresistance by en- 
hancing the transcription of a multidrug resistance gene (80). 



Activation of Signal transduction Pathways 
Activation of signal transduction pathways by loss of tumor 
suppressor genes or overexpression of certain tyrosine kinases 
can contribute to the growth and aggressiveness of tumors. An 
important mutant in human cancers is the tumor suppressor 
gene PTEN, which leads to the activation of the Pt3K/Akt path- 
way. Activation of PI3K and Akt induces the oncogenic trans- . 
formation of chicken embryo fibroblasts. The transformed cells 
show constitutive phosphorylation of S6K and of 4E-BP1 (81). 

. A mutant Akt that retains kinase activity but does not phos- 
phorylate S6K or 4E-BP1 does not transform fibroblasts, which 
suggests a correlation between the oncogenicity of PI3K and 
Akt and the phosphorylation of S6K and 4E-BP1 (81). 

Several tyrosine kinases such as platelet-derived growth 
factor, insulin- like growth factor, HER2/neu, and epidermal 
growth factor receptor are overexpressed in cancer. Be- 
cause these kinases activate downstream signal transduc- 
tion pathways known to alter translation initiation, activation 
of translation is likely to contribute to the growth and. aggres- 
siveness of these tumors. Furthermore, the mRNA for many 
of these kinases themselves are under translational control. 
For example, HER2/neu mRNA is translationaily controlled 
both by a short upstream open reading frame that represses 

. HER2/neu translation in a cell type-independent manner and 
by a distinct cell type- dependent mechanism that increases . 
translational efficiency (82). HER2/neu translation is different 
in transformed and normal cells. Thus, it is possible that 
alterations at the translational level can. in part account for 
the discrepancy between HER2/neu gene amplification de- 
tected by fluorescence in situ hybridization and protein levels 
detected by immunohistochemical assays. 



Translation Targets of Selected Cancer Therapy 
Components of the translation machinery and signal path- 
ways involved in the activation of translation initiation repre- 
sent good targets for cancer therapy. 

Targeting the mTOR Signaling Pathway: Rapamycin 
and Tumstatin 

Rapamycin inhibits the proliferation of lymphocytes. It was 
initially developed as an immunosuppressive drug for organ 



transplantation. Rapamycin with FKBP 12 (FK50f>binding 
protein, M t 12,000) binds to mTOR to inhibit its function. 

Rapamycin causes a small but significant reduction "m the 
Initiation rate of protein synthesis (83). It blocks cell growth in 
part by blocking S6 phosphorylation and selectively sup- 
pressing the translation of 5' TOP mRNAs, such as ribosomal 
proteins, and elongation factors (83-85). Rapamycin also 
blocks 4E-BP1 phosphorylation and inhibits cap-dependent 
but not cap- independent translation (17,. 86). 

The raparrtydrvsensirive signal transduction pathway, acti- 
vated during malignant transfprrnation and cancer progression, 
is now being studied as a target for cancer therapy (87). Pros- 
tate, breast, small cell lung, glioblastoma, melanoma, and T-cel) 
leukemia are among the cancer fines most sensitive to the 
rapamycin analogue CCI-779 (Wyeth-Ayerst Research; Ref. 
87). In rhabdomycosarcoma cell fines, rapamycin is either cyto- 
static or cytocidal, depending on the p53 status of the cell; p53 
wild-type cells treated with rapamycin arrest in the G, phase 
and maintain their viability, whereas p53 mutant cells accumu- 
late in G t and undergo apoptosis (88, 89). In a recently reported 
study using human primitive neuroectodermal tumor and 
rnedulloblastorna models, rapamycin exhibited more cytotox- 
icity in combination with dsplatin and camptothecin than as a 
single agent in wvo, CCf-779 delayed growth of xenografts by 
1 60% after 1 week of therapy and 240% after 2 weeks. A single 
high-dose administration caused a 37% decrease in tumor 
volume. Growth inhibition in vivo was t.3 times greater, with 
cisplatin in combination with CCI-779 than with cisplatin alone 

(90) . Thus, preclinical studies suggest that rapamycin ana- 
logues are useful as single agents and in combination with 
chemotherapy. 

Rapamycin analogues CCI-779 and RAD001 (Novartis, 
Basel, Switzerland) are now in clinical trials. Because of the 
known effect of rapamycin on lymphocyte proliferation, a 
potential problem with rapamycin analogues is immunosup- 
pression. However, although prolonged immunosuppression 
can result from rapamycin and CCI-779 administered on 
continuous-dose schedules, the immunosuppressive effects 
of rapamycin analogues resolve in —24 h after therapy 

(91) . The principal toxicities of CCI-779 have included der- 
matological toxicity, myetosuppression, infection, mucositis, 
diarrhea, reversible elevations in liver function tests, hyper- 
glycemia, hypokalemia, hypocalcemia, and depression (87, 
92-94). Phase II trials of CCI-779 have been conducted in 
advanced renal cell carcinoma and in stage llt/IV breast 
carcinoma patients who failed with prior chemotherapy. In 
the results reported in abstract form, although there were no 
complete responses, partial responses were documented in 
both renal cell carcinoma and in breast carcinoma (94, 95). 
Thus, CCI-779 has documented preliminary clinical activity in 
a previously treated, unselected patient population. 

Active investigation is under way into patient selection for 
mTOR inhibitors. Several studies have found an enhanced 
efficacy of CCI-779 in PTEN-null tumors (30. 96). Another 
study found that six of eight breast cancer cell lines were 
responsive to CCI-779, although only two of these lines 
lacked PTEN (97) There was, however, a positive correlator* 
between Akt activation and CCI-779 sensitivity (97). This 
correlation suggests that activation ol the PI3K-AW pathway. 
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regardless of whether it is attributable to a FTEN mutation or 
to over expression of receptor tyrosine kinases, makes can- 
cer ceil amenable to mTOR- directed therapy. In contrast, 
tower levels of the target of mTOR, 4E-BP1, are associated 
with rapamycin resistance; thus, a tower 4E-BP1/elF4E ratio 
may predict rapamycin resistance (98). 

Another mode of activity for rapamycin and its analogues 
appears to be through inhibition of angiogenesis. This activ- 
ity may be both through direct inhibition of endothelial cell 
profit eration as a result of mTOR inhibition in these cells or by 
inhibition of translation of such proangiogenic factors as 
vascular endothelial growth factor in tumor cells (99, 100). 

The angipgenesis inhibitor tumstatin, another anticancer 
drug currently under study* was also found recently to inhibit 
translation In endothelial cells (101). Through a requisite in- 
teraction with integrin, tumstatin inhibits activation of the 
Pt3K/Akt pathway and mTOR in endothelial cells and pre- 
vents dissociation of elF4E from 4E-BP1, thereby inhibiting 
cap-dependent translation. These findings suggest that en- 
dothelial cells are especially sensitive to therapies targeting 
the mTOR-signaling pathway. 



ment also reduces the expression of angiogenic factors (115) 
and has been proposed as a potential adjuvant therapy for head 
and neck cancers, particularly when elevated etF4E is found in 
surgical margins. Small molecule inhibitors that bind the elF4G/ 
4E-BP1 -binding domain of etF4E are proapoptotic 016) and 
are ateo being actively pursued. 

Exploiting Selective Transition for Gene Therapy 
A different therapeutic approach that takes advantage of the 
enhanced cap-dependent translation in cancer cells is the use 
of gene therapy vectors encoding suicide genes with highly 
structured 5' UTa These mRNA would thus be at a competitive 
disadvantage in normal cells and not translate well, whereas in 
cancer cells, they would translate more efficiently. For example, 
the introduction of the 5' UTR of fibroblast growth factor-2 5' to 
the coding sequence of herpes simplex virus type-1 thymidine 
kinase gene, allows for selective translation of herpes simplex 
virus type-1 thymidine kinase gene in breast cancer cell fines 
compared with normal mammary cell fines and results in se- 
lective sensitivity to ganciclovir (1 17). 



Targeting elF2<c EPA, Clotrimazole, mda-7, 
and Flavonoids 

EPA is an n-3 polyunsaturated fatty acid found in the fish- 
based diets of populations having a low incidence of cancer 
(102). EPA inhibits the proliferation of cancer cells (103), as 
well as in animal models (104, 105). It blocks cell division by 
inhibiting translation initiation (105). EPA releases Ca 2 * from 
intracellular stores while inhibiting their refilling, thereby ac- 
tivating PKR. PKR, in turn pbosphorylates and inhibits e!F2a, 
resulting in the inhibition of protein synthesis at the level of 
translation initiation. Similarly, clotrimazole, a potent antipro- 
liferative agent in vitro and in vivo, inhibits cell growth through 
depletion of Ca 2+ stores, activation of PKR, and phospho- 
rylation of elF2a (106). Consequently, clotrimazole preferen- 
tially decreases the expression of cyclins A, E, and D1, 
resulting in blockage of the cell cycle in G v 

mda-7 is a novel tumor suppressor gene being developed 
as a gene therapy agent. Adenoviral transfer of mda-7 (Ad- 
mda7) induces apoptosis in many cancer cells including 
breast, colorectal, and lung cancer (107-109). Ad-mda7 also 
induces and activates PKR, which leads to phosphorylation 
of elF2a and induction of apoptosis (110). 

Flavonoids such as genistein and quercetin suppress tu- 
mor cell growth. All three mammalian elF2or kinases, PKR, 
heme-regutated inhibitor, and PERK/PEK, are activated by 
flavonoids, with phosphorylation of elF2a and inhibition of 
protein synthesis (111). 

Targeting elF4A and eJF4E: Antisense RNA 
and Peptides 

Antisense expression of elF4A decreases the proliferation rate 
ot melanoma celts (112). Sequestration of elF4E by overexpres- 
sion ot 4E-BP1 is proapoptotic and decreases tumorigenicity 
(113, 114). Reduction of elF4E with antisense RNA decreases 
soft agar growth, increases tumor latency, and increases the 
rates of tumor doubling times (7). Antisense elF4E RNA treat- 



Toward the Future 

Translation is a crucial process in every cell. However, several 
alterations in translational control occur in cancer. Cancer cells 
appear to need an aberrantly activated translational state for 
survival, thus allowing the targeting of translation initiation with 
surprisingly low toxicity. Components of the translational ma- 
chinery, such as elF4E, and signal transduction pathways in- 
volved in translation initiation, such mTOR, represent promising 
targets for cancer therapy. Inhibitors of the mTOR have already 
shown some preliminary activity in clinical trials. It is possible 
that with the development of better predictive markers and 
better patient selection, response rates to single-agent therapy 
can be improved. Simitar to other cytostatic agents, however, 
mTOR inhibitors are most likely to achieve clinical utility in 
combination therapy. In the interim, our increasing understand- 
ing of translation initiation and signal transduction pathways 
promise to lead to the identification of new therapeutic targets 
in the near future. 
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Correlative iiuiuunobistocheniical and reverse transcriptase polymerase 
chain reaction analysis of somatostatin receptor type 2 in neuroendocrine 
tumors of the lung. 

Papolti M , CroceS, Macri L . Funaro A , Pccchioni C , Schindlcr Bussolati G . 

Department of Biomedical Sciences and Oncology, University of Turin, Italy. 

Somatostatin receptors type 2 (sst2) have been frequently detected in neuroendocrine . 
tumors and bind somatostatin analogues, such as octreotide, with high affinity. Receptor 
autoradiography, specific mRNA detection and, more recently, antisst2 polyclonal 
antibodies are currently employed to reveal sst2. The aim of the present study was to 
investigate by three different techniques the presence of sst2 in a series of 26 
neuroendocrine tumors of the lung in which fresh frozen tissue and paraffin sections were 
available. It was possible, therefore, to compare, in individual cases, RNA analysis 
studied by reverse transcriptase polymerase chain reaction (RT-PCR), in situ 
hybridization (ISH), and immunohistochemistry. A series of 20 nonneuroendocrine lung- 
carcinoma samples served as controls. RT-PCR was positive for sst2 in 22 of 26 samples, 
including 15 of 15 typical carcinoids, 5 of 6 atypical carcinoids, and 2 of 5 small-cell 
carcinomas. The sst2 mRNA signal obtained by RT-PCR was strong in the majority 
(87%) of typical carcinoids and of variable intensity in atypical carcinoids and small-cell 
carcinomas. A wieakly positive signal was observed in 5 of 20 control samples. In 
immunohistochemistry, two different antibodies (anti-sst2) were employed, including a 
monoclonal antibody, generated in die Department of Pathology, University of Turin. In 
the majority of samples a good correlation between sst2 mRNA (as detected by RT-PCR) 
and sst2 protein expression (as detected by immunohistochemistry) was observed. 
However, one atypical carcinoid and one small -cell carcinoma had focal immunostaining 
but no RT-PCR signal. ISH performed in selected samples paralleled the results obtained 
with the other techniques. A low sst2 expression was associated with high grade 
neuroendocrine tumors and with aggressive behavior. It is concluded that I) 
neuroendocrine tumors of the lung express sst2, and there is a correlation between the 
mRNA amount and die degree of differentiation; 2) immunohistochemistry and ISH are 
reliable tools to demonstrate sst2 in these tumors; and 3) sst2 identification in tissue 
sections may provide information on the diagnostic or therapeutic usefulness of 
somatostatin analogues in individual patients with neuroendocrine tumors. 
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Expression of somatostatin receptor types 1-5 in 81 cases of 
gastrointestinal and pancreatic endocrine tumors, A correlative 
immunohistocheniical and reverse-transcriptase polymerase chain 
reaction analysis. 

Papotti M, Bongiovanni ML Volnnte M Ailia £, Laadoifi S , Helboe L , Schindler ML 
Cole SL , Bussolati G . 

Department of Biomedical Sciences and Oncology, University of Turin, Via Santena 7, 
10126 Turin, Italy, mauro.papotti@unito.it 

Somatostatin receptors (SSTRs) have been extensively mapped in human tumors by 
means of autoradiography, reverse-transcriptase polymerase chain reaction (RT-PCR), in 
situ hybridization (iSH) and immunohistochemistry (IHC). We analyzed the SSTR type 
1-5 expression by means of RT-PCR and/or IHC in a series, of 81 functioning and non- 
functioning gastroenteropancreatic (GEP) endocrine tumors and related normal tissues. 
Moreover, we compared the results with clinical, pathological and hormonal features. 
Forty-six cases (13 intestinal and 33 pancreatic) were studied for SSTR 1-5 expression 
using RT-PCR, IHC with antibodies to SSTR types 2, 3, 5 and ISH for SSTR2 mRNA. 
The vast majority of tumors expressed SSTR types 1 , 2, 3 and 5, while SSTR4 was 
detected in a small minority. Due to the good correlation between RT-PCR and IHC data 
on SSTR types 2, 3, and 5, thirty-five additional GEP endocrine tumors were studied with 
IHC alone. Pancreatic insulinomas had an heterogeneous SSTR expression, while 100% 
of somatostatinomas expressed SSTR5 and 100% gastrinomas and glucagonomas 
expressed SSTR2. Pre-opcrative biopsy material showed an overlapping 
immunoreactivity with that of surgical specimens, suggesting that the SSTR status can be 
detected in the diagnostic work-up. It is concluded that SSTRs 1-5 are heterogeneously 
expressed in GEP endocrine tumors and that IHC is a reliable tool to detect SSTR types 
2 3 3 and 5 in surgical and biopsy specimens. 
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transformation of cervical cancer, 

Rudlowski C . Becker AJ , Schroder W , Rath W . Buttner R . Moser ML 

Dept of Gynecology and Obstetrics, University Hospital Heidelberg, Vossstr 7-9, D- 
691 15 Heidelberg, Germany. 

We studied whether induction of glucose transporters (GLUTs) I to 4 correlates with 
human papillomavirus (HPV)-dependent malignant transformation of cervical epithelium. 
Tissue samples of cervical intraepithelial neoplasia (CIN; grades 1 to 3), invasive 
carcinomas, and lymph node metastasis were examined. HPV typing was performed. 
Tissue sections were immunostained with GLUTl to GLUT4 antibodies. Messenger 
RNA (mRNA) in situ hybridization confirmed GLUTl protein expression. Weak : 
expression of GLUTl was found in nondysplastic HPV-positive and HPV-negative 
epithelium; significant expression was observed in preneoplastic lesions, correlating with 
the degree of dysplasia. In CtN 3 high-risk HPV lesions, cervical cancer, and metastasis, 
GLUTl. was expressed at highest levels with a strong correlation of GLUTl mRNA and' 
protein expression. Irnmunostains for GLUT2 to GLUT4 were negative. Cervical tumor 
cells respond to enhanced glucose utilization by up-regulation of GLUTl . The strong 
induction of GLUTl mRNA and protein in HPV-positive CIN 3 lesions suggests GLUTl 
oyerexpression as an early event in cervical neoplasia. GLUTl is potentially relevant as a 
diagnostic tool and glucose metabolism as a therapeutic target in cervical cancer. 
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Abstract Somatostatin receptors (SSTRs) have been 
extensively mapped in human tumors by means of auto- 
radiography, reverse-transcriptase polymerase chain reac- 
tion (RT-PCR), in situ hybridization (ISH) and immuno- 
histochemistry (IHC). We analyzed the SSTR type 1-5 
expression by means of RT-PCR and/or IHC in a series 
of 81 functioning and non-functioning gastroenteropan- 
creatic (GEP) endocrine tumors and related normal tis- 
sues. Moreover, we compared the results with clinical, 
pathological and hormonal features. Forty-six cases (13 
intestinal and 33 pancreatic) were studied for SSTR 1-5 
expression using RT-PCR, IHC with antibodies to SSTR 
types 2, 3, 5 and ISH for SSTR2 mRNA. The vast majori- 
ty of tumors expressed SSTR types 1, 2, 3 and 5, while 
SSTR4 was detected in a small minority. Due to the good 
correlation between RT-PCR- and IHC data on SSTR 
types 2, 3, and 5, thirty-five additional GEP endocrine tu- 
mors were studied with IHC alone. Pancreatic insulino- 
mas had an heterogeneous SSTR expression, while 100% 
of somatostatinomas expressed SSTR5 and 100% gas- 
trinomas and glucagonomas expressed SSTR2. Pre-oper- 
ative biopsy material showed an overlapping immunore- 
activity with that of surgical specimens, suggesting that 
the SSTR status can be detected in the diagnostic work- 
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up. It is concluded that SSTRs 1-5 are heterogeneously 
expressed in GEP endocrine tumors and that IHC is a re- 
liable tool to detect SSTR types 2, 3 and 5 in surgical and 
biopsy specimens. 

Keywords GEP • Somatostatin receptors • Pancreatic 
islets * Neuroendocrine tumor • Immunohistochemistry * 
RT-PCR 



Introduction 

The role of somatostatin analogues in the clinical setting 
is well established: long-acting analogues have been 
used for more than 10 years to treat endocrine tumors, 
and octreotide scintigraphy is widely used for tumor 
localization and even for radio- guided surgery [5, 9, 15, 
16, 19, 22, 45]. The rationale behind these clinical appli- 
cations is based on the widespread occurrence of so- 
matostatin receptors (here abbreviated SSTR - according 
to the GenBank nomenclature) in human tumors, with 
special reference to endocrine tumors [42]. To date, five 
SSTR subtypes have been identified (types 1-5) [23, 27, 
50, 51] with different affinities for the natural ligand 
somatostatin as well as for synthetic analogues. SSTRs 
have been extensively mapped in human endocrine and 
non-endocrine tumors by means of autoradiography, 
Northern blot, reverse-transcriptase polymerase chain 
reaction (RT-PCR) and in situ hybridization (ISH) [4, 10, 
14, 20, 24, 29, 30, 31, 32, 33, 34, 39, 44]. Irrespective of 
the original location of the tumor, most studies demon- 
strated a high incidence of SSTR1 and SSTR2 (and to a 
minor degree of the other types). 

In recent years, several authors have developed anti- 
bodies for immunohistochemical demonstration of SSTR 
subtypes [8, 13, 17, 35, 40, 41, 43]. Such polyclonal an- 
tibodies, raised against peptide fragments of all five 
known SSTR subtypes, were mostly used for the immu- 
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nohistochemical demonstration of SSTR in brain regions 
or in normal human or animal tissues. RT-PCR has the 
great advantage of a high sensitivity, but does not allow 
the precise SSTR cellular localization, which can be ob- 
tained by parallel ISH or immunohistochemical investi- 
gations [17, 25, 32, 33, 43]. 

In gastro-entero-pancreatic (GEP) endocrine tumors, 
SSTRs were analyzed using various procedures, including 
immunohistochemistry (IHC) for SSTR2. Expression of 
this receptor type in a variable percentage of pancreatic tu- 
mors and intestinal carcinoids, and a correlation between 
presence of SSTRs and scintigraphic localization and/or 
response to octreotide therapy were reported [13, 19, 22]. 
However, individual methods to reveal SSTRs were com- 
pared and correlated in a limited case series of GEP endo- 
crine tumors, and immunohistochemical detection was re- 
stricted to SSTR2A analysis only. An exception was the 
large studies of the groups of Jais and Wulbrand [12, 49], 
who investigated all five SSTR subtypes using the highly 
sensitive RT-PCR technique and found high levels of 
SSTR1, 2 and 5 mRNA expression and low amounts of 
SSTR3 and SSTR4 mRNA. However, no parallel data on 
SSTR cellular localization were presented. . 

The aim of this study was to analyze the expression of 
SSTRs 1-5 by means of RT-PCR and/or IHC in a large 
series of 81 functioning and non-functioning GEP endo- 
crine tumors and to compare the results with clinical, 
pathological and hormonal features. Specific antibodies 
to SSTRs 2, 3 and 5 (which are the subtypes with the 
highest affinity for octreotide) were used in parallel with 
the analysis of mRNA expression levels. A heterogene- 
ous distribution of SSTR in both pancreatic and intesti- 
nal endocrine tumors was observed and the immunohis- 
tochemical SSTR detection was found to be a reliable di- 
agnostic tool in both surgical and pre-operative biopsy 
specimens, relative to mRNA analysis. 



Materials and methods 

Case selection 

Forty- six GEP endocrine tumors, in which fresh frozen material 
was available for mRNA analysis (13 aud 33 tumors of gastroin- 
testinal and pancreatic origin, respectively), were collected from 
the pathology files of the University of Turin. These tumors were 
studied for SSTR expression by means of RT-PCR and IHC. Since 
the results of the two procedures showed a good correlation (see 
below), the study was expanded to 35 additional GEP neoplasms 
in which formalin-fixed and paraffin-embedded tissue blocks were 
available for immunohistochemical staining only. The reason for 
entering additional cases was to investigate SSTR expression in all 
types of hormone-producing pancreatic tumors and to analyze a 
series of aggressive (metastatic) intestinal carcinoids to compare 
SSTR expression with the tumor stage. Overall, 81 cases were 
entered in the present study, including 28 gastrointestinal aud 53 
pancreatic endocrine tumors. 

In four cases, the frozen material was represented by liver or 
lymph- node metastatic tissue, in the absence of frozen tissue from 
the primary tumor. In seven additional cases, both primary tumors 
and the corresponding metastatic deposits were analyzed in paral- 
lel using IHC. Finally, in four cases, the pre-operative cytological 
or biopsy material was also retrieved and studied in parallel with 
the corresponding surgical sample. 



Four carcinoids of the gastric body were associated with 
chrouic atrophic gastritis and neuroendocrine cell hyperplasia [2] . 
In these cases, also peritumoral normal and hyperplastic neuroen- 
docrine cells were investigated for SSTR expression. 

All cases were reviewed on conventional hematoxylin and 
eosin-stained slides and on slides immunostained for endocrine 
markers, and classified following the recent World Health Organi- 
zation (WHO) classification of endocrine tumors [46]. Cliuico- 
pathological data including sex, age, tumor location and size, hor- 
monal status and lymph- node or distant metastases were obtained 
for all patients. Follow-up data were available for 54 patients. 



Normal pancreas 

Although this study was not specifically designed to study SSTR 
distribution in normal pancreatic islets, peritumoral pancreatic tis- 
sue, when present, as well as two control cases of uon-neoplastic 
pancreatic tissue, were also investigated to compare the immuno- 
histochemical reactivity of the currently employed SSTR anti- 
bodies with that reported in the literature. 

RT-PCR for SSTRs 1-5 

Total RNA was extracted from a frozen tumor fragment adjacent 
to that used for conventional histopathological diagnosis using the 
Tri Reagent (Molecular Research Center Inc., Cincinnati, Ohio.) 
extraction kit following manufacturer's recommendations. To 
avoid contamination with normal peritumoral pancreatic or lym- 
phatic tissue, frozen sections were rapidly stained with conven- 
tional hematoxylin and eosin, and the normal tissue possibly pres- 
ent was removed with a surgical blade. The concentration of RNA 
was estimated by means of spectrophotometry, and RNA degrada- 
tion assessed using 1% agarose gel electrophoresis. Total RNA 
( 1 ug) was first digested with 10 U RNase-free DNase (Boehringer, 
Mannheim, Germany) in a 10- ul solution containing 2 mM MgCl 2 
to avoid DNA contamination. The solution was kept at room tem- 
perature (RT) for 10 min, then heated for 5 min at 70°C to inacti- 
vate DNase; 40 pM of oligodeoxythymidine primer (oligo dT 16 ) 
was added and the solution heated at 70°C for 10 min, then chilled 
on ice to allow primer hybridization. The final solution was re- 
verse transcribed using 100 U superscript reverse transcriptase 
(Gibco BRL, Gaitersburg, Md.), and complementary DNA 
(cDNA) was generated in a 50-ul final reaction volume containing 
50 mM Tris-HCl pH 8.3, 75 mM KC1, 3 mM MgCl 2 , 10 mM DTT, 
1 mM dNTPs and 20 U RNasin (Promega, Madison, Wis.). The 
solution was incubated at 37°C for 90 min, then the enzymes inac- 
tivated by heating at 70°C for 10 min. Omission of RT enzyme 
served as a negative control for further PCR amplification reac- 
tions. RNA integrity was determined by performing a PCR reac- 
tion for [^-microglobulin [7]. Only cases positive for p 2 -micro- 
globulin mRNA amplification were considered informative for 
SSTR expression analysis. 

PCR experiments were carried out in a 10- ul final reaction vol- 
ume containing 1 pi cDNA template, 1 pM sense and antisense 
oligonucleotide primers, 10 mM Tris-HCl pH 8.3, 50 mM KC1, 
200 nM deoxynucleotide triphosphates (dNTPs), 1 mM MgCl 2 
aud 0.5 U Taq polymerase (GeneAmp, Perkiu Elmer, Roche, 
Branchburg, N.J.). The primers used for RT-PCR and the corre- 
sponding specific annealing temperatures are reported in Table 1. 
^-microglobulin and SSTR2 PCR reactions were performed uu- 
der the following conditions: 35 cycles, each cycle consisting of 
denaturation at 94°C for 1 min, annealing at specific temperatures 
for 1 min and extension at 72°C for 2 miu. PCR conditions for 
SSTR3 (35 cycles) and SSTRS (40 cycles) were as follows: dena- 
turation at 94°C for 45 s, annealing at specific temperatures for 
1 min and extension at 72°C for 1 nun. SSTR1 and SSTR4 PCR 
reactions were performed under the following conditions: 40 cy- 
cles, each cycle consisting of denaturation at 94°C for 1 min, an- 
nealing at specific temperatures for 30 s and extension at 72°C for 
75 s. PCR products were visualized under UV light in 1% agarose 
gel containing 0.5 ug/ml ethidium bromide. Omission of cDNA in 
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Table 1 Sequences of primers used for reverse transcriptase-polymerase chain reaction (PCR) 



Primer 



Position GenBank Size of PCR Annealing References 

accession no. product (bp) temperature (°C) 



P2-Mictoglobulin 

Sense: 5'acccccactgaaaaagatga3' 284-303 XM_007650 114 

Antiseuse: 5'atcttcaaacctccatgatg3' 378-397 

SSTR1 

Sense: 5'gctacgtgctcatcattgcta3' 725-745 NM_001049 401 

Autisense: 5'ggactccaggttctcaggttg3' 1 105-1 1 25 

SSTR2 

Sense: 5 'cagtcatgagcatcgaccga3' 402-421 NM_001050 283 

Antisense: 5'gcaaagacagatgatggtga3' 665-684 

SSTR3 

Sense: 5'tcatctgcctctgctacctg3' 662-681 NM_001051 221 

Antisense: 5'gagcccaaagaaggcaggct3' 863-882 

SSTR4 

Sense: 5'atcttcgcagacaccagacc3' 547-576 NM„001052 321 

Antisense: 5'atcaaggctggtcacgacga3' 848-867 

SSTR5 

Sense: 5'gccggcctctacttcttcgtg3' 847-866 NMJXH053 153 

Antisense: 5'ccgtggcgtcagcgtccttgg3' 980-999 



55 



60 



61 



57 



60 



60 



[7] 



[20] 



[50] 



[20] 



[20] 



[20] 



the PCR mixture served as negative control. The RNA extracted 
from an H716 neuroendocrine colon carcinoma cell line knowu to 
express SSTR served as a positive control for SSTR amplification. 
To confirm the specificity of SSTR PCR reactions, PCR products 
were agarose- gel purified (Concert Gel Extraction Systems, Gibco 
BRL, Life Technologies, Milan, Italy) and ethauol precipitated, 
then subjected to nucleic acid direct sequencing using a fluores- 
cence-based sequencing kit (Applied Biosystems, Weiterstadt, 
Germany); electrophoresis and data analysis were carried out in an 
automated DNA sequencer ABI PRISM 377 (Applied Biosys- 
tems), following manufacturer's recommendations. The RT-PCR 
was not quantitative. Nevertheless, we used the same controls as 
those used in a former study on SSTR2 mRNA expression in neu- 
roblastomas using quantitative RT-PCR [44]. Control tissues 
found in the previous tests to express a number of SSTR2 mRNA 
molecules lower than 10 s were negative in the present RT-PCR 
analysis, while the presence of a strong amplifi cation band corre- 
sponded to a number of SSTR2 mRNA molecules larger than 10 7 . 
No undefined bands were found in the present cases, and the tu- 
mors were recorded as positive or negative based on the presence 
or absence of a specific band, irrespective of its intensity. 

Antibodies-SSTR2 

A polyclonal antibody anti-SSTR2 was produced and characterized 
by one of us (MS) [40, 41]. This antibody (coded K230) was raised 
in sheep and was specific for a sequence of 20 amino acids of the 
C-terminal portiou of the SSTR2A (KSRLNETTETQRTLLNEDLQ, 
positions 347-366). Another antibody to SSTR2A, purchased from 
Biotrend/Gramsch Laboratories (Schwabhausen, Germany) was also 
used and found to have an overlapping reactivity pattern with the 
former SSTR2 antiserum. The antibody to SSTR3 was purchased 
from Biotrend/Gramsch Laboratories. The characterization of these 
latter antibodies was reported by Schultz et al [43]. The serum anti- 
SSTR5 (coded MK86) was produced in the laboratory of one of us 
(MS) against the human SSTR5 receptors (see below). 

Production and characterization of auti-SSTR5 antibodies 

The following peptide sequence was synthesized (amino acid sin- 
gle letter code): CGAKDADATEPRPDR (corresponding to a car- 



4 -5 



B. 55kD : 



Fig. 1 Western blot of anti-human somatostatin receptor (SSTR)5 
antibody (MK86) reactivity in trausfected Chinese hamster ovary 
(CHO) cells expressing SSTR1-5 (columns 1-5). A specific baud 
is present in SSTR5-transfected cells only 



boxy-terminal fragment of human SSTR5, positions 325-339). 
The peptide was coupled to keyhole limpet hemocyanin (KLH; 
Sigma, Poole, UK) and injected into one rabbit (MK86). Test 
bleeds were taken after each booster injection, and a termiual 
bleed was carried out. Serum was obtained from this bleed and 
used in IHC and Western blotting. The specificity of the novel 
SSTR5 antiserum was confirmed by means of Western-blot exper- 
iments on membranes obtained from Chinese hamster ovary 
(CHO) cells expressing the recombinant human SSTR types 1-5, 
carried out as described elsewhere [40]. Application of anti-human 
SSTR5 antiserum yielded a signal in CHO cells expressing the re- 
spective SSTR5 only (Fig. 1). In these experiments, a broad baud 
with smeared appearance was observed with a molecular weight 
centered at approximately 55 kDa. No cross-reactivity was ob- 
served to any other cell line expressing recombinant SSTRs differ- 
ent from type 5 (Fig. 1). Pre -adsorption controls were carried out 
by pre-incubating the antibodies with 100 uM of the antigenic 
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Table 2 Summary of clinico-pathological features of 81 gastroenteropancreatic eudocriue tumors 



Gastrointestinal 



Pancreatic 



Total no. of cases 

Diagnosis (WHO 2000): 

Well-differentiated endocrine tumor 
Well-differentiated endocrine carcinoma 
Poorly differentiated endocrine carcinoma 
Hormonal status (no. of cases) 



Sex male/female 
Age (median, years) 
Tumor size (mean, cm) 
Location 

Metastases (regional lymph node or liver) 

Studied using reverse-transcriptase 

polymerase chain reaction and 

inimunohistochemistry 

Studied using immunohistochemistry only 

(Table 3) 



28 



6 
19 
3 

Gastrin (3) 



14/14 
60 
2.8 

Stomach (6), ileum (13), 
appendix(4), colon/rectum (5) 
16/28 (57.1%) 
13 



15 



53 



37 
14 
2 

Insulin (25), glucagon (5), somatostatin (4), gastrin (5), 
pancreatic polypeptide (1), vasoactive intestinal 
peptide (1), uon functioning (14) 
25/28 
56 
4.0 

Head (22), Body-tail (31) 

17/53 (32%) 
33 



20 



peptide for 16-18 h prior to immune reactions and resulted in the 
abolition of the signal in either Western blotting or immunohisto- 
chemical studies, using human pituitary gland formalin-fixed sec- 
tions as positive controls (data not shown). 



Immuuohistochemistry 

Sections (4- to 5-um-thick) from buffered formalin-fixed, paraffin- 
embedded tissue, adjacent to those used for conventional htsto- 
pathological and ininiunohistochemical examination, were col- 
lected onto poly-L-lysine-coated slides. Primary antibodies to 
SSTR2A (commercial serum), SSTR3 and SSTR5 were applied to 
tissue sections with prior antigen retrieval (three 3-min passages in 
microwave oven at 750 W in 10 niM citrate buffer pH 6.0) for 
30 miu at RT, at the dilutions of 1/20000, 1/40000 and 1/35000, 
respectively. The other anti-SSTR2A antiserum (K230) was ap- 
plied overnight at 4°C at a dilution of 1/2000 with no prior antigen 
retrieval. The immune reactions were then revealed with the im- 
munoperoxidase technique using the streptavidin-peroxidase kit 
(Dako, Glostrup, Denmark) and a further amplification with bio- 
tiny lated-tyranude (GenPoiut, Dako, diluted 1/5) with the subse- 
quent washing step performed in hot PBS (at 95°C) to reduce the 
background and increase the sensitivity, as described by Volaute et 
al. [48]. Diaminobenzidine was used as chromogen, followed by a 
weak nuclear counterstain (or no counterstain in selected cases). 
Control stainings for all antibodies included immunoperoxidase 
reaction on serial sections using pre-absorbed serum or buffer in- 
stead of the primary antibody. All pancreatic endocrine tumors 
were immunostained in parallel sections for insulin (polyclonal, 
diluted 1/400, Dako) glucagon (polyclonal, diluted 1/5, BioGenex, 
San Ramon, Calif) somatostatin (polyclonal, diluted 1/3000, Se- 
rotec, Oxford, UK) and gastrin (polyclonal, diluted 1/50, Bio- 
Genex). 

Double immunostaining for each SSTR subtype and individ- 
ual pancreatic hormones was performed in the two control pan- 
creatic tissues and in selected tumors with peri tumoral islets 
available for co-localization studies. A sequential immunohtsto- 
chemical procedure under the same conditions described above 
was performed, finally revealing the receptor proteins with green 
fluorescence and the pancreatic hormones of alpha, beta or delta 
cells with red fluorescence. For the purpose of co-localization 
analysis of receptor types and single pancreatic hormones, 250 
cells (for each cell type) were counted in nearby islets and the rel- 
ative percentage of cells co- localizing either SSTR type was re- 
corded. 



In situ hybridization 

Selected tumors (12 cases, including 7 tumors with discrepant 
RT-PCR and IHC findings) were also analyzed for SSTR2 mRNA 
expression by means of a non-radioactive, tyramide deposition- 
based ISH technique. The detailed procedure was described else- 
where [24] and employed a 48-bp oligonucleotide corresponding 
to positions 91-139 of the SSTR2 gene [50]. Appropriate negative 
controls in all the cases investigated included hybridization of par- 
allel sections using an unrelated probe specific for Ebstein-Barr 
virus mRNA. The specificity of the hybridization signal was also 
confirmed in some cases using a sense probe, omitting the specific 
probe or pre-treating the sections with RNAse. 



Clinico-pathological correlation 

The results of SSTR analysis were compared with cliuico-patho- 
logical data, including age, sex, tumor locatiou and size, recur- 
rence or metastases, tumor grade, and hormonal status. Statistical 
evaluation was performed by means of the AN OVA and chi- 
square (Yates corrected) tests. The data were also related to the 
outcome in those patients in which this information was available. 
This study was performed in accordance with the ethics standards 
laid down in the 1964 Declaration of Helsinki. 



Results 

Clinico-pathological data 

The relevant clinico-pathological data of 53 pancreatic 
and 28 gastrointestinal endocrine tumors are summarized 
in Table 2. The majority of cases were well- differentiat- 
ed endocrine tumors (carcinoids) or well-differentiated 
endocrine carcinomas. Five cases of poorly differentiat- 
ed endocrine carcinomas (with small or large cell fea- 
tures) were also included. The male to female ratio was 
approximately 1 in both groups of pancreatic and intesti- 
nal tumors, with the patients' median age being 56 years 
and 60 years in the two groups, respectively. The pancre- 
atic and gastrointestinal tumors measured 0.4-20 cm 
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(mean 4.0 cm) and 0.3—11 cm (mean 2.8 cm), respec- 
tively. Lymph- node and/or liver metastases were ob- 
served in 57.1% of intestinal tumors and in one-third 
(33.9%) of pancreatic tumors. 



SSTR in pancreatic endocrine tumors 

RT-PCR analysis, performed in 33 cases, showed mRNA 
amplification of SSTR1 in 30 of 33 (90/1%), of SSTR2 
in 28 of 33 (84.8%), of SSTR3 in 26 of 33 (78.8%), of 
SSTR4 in 8 of 33 (24.2%) and of SSTR5 in 14 of 33 
(42.4%) cases (Fig. 2a, Table 3), In all cases, the preser- 
vation of total mRNA was ensured by detectable amplifi- 
cation of (32- microglobulin mRNA. IHC was performed 
using anti-SSTR 2, 3 and 5 antibodies. The immunohis- 
tochemical reactivity was markedly heterogeneous, be- 
ing in some cases focally positive (in less than 25% of 
tumor ceils, scored one plus, +; Table 3) and in others 
diffusely immunostained in over 50% of tumor cells 
(score +++). Some cases had an intermediate reactivity 
with between 25% and 50% of tumor cells positive 
(score 4-f). A cell membrane distribution of the immuno- 
reactivity was present with anti-SSTR2 antibodies, 




IX < +C 1 I 3 * 5 S 7 6 8 10 11 12 13 



S3t2 

^ ^ :^v.^v: : :-:.ssyv.>^..,:v,. .,, -,,>•■ ......... 

i.X C +C 1 2 3 Jt S ft 7 6 9 10 11 12 13 



?S; SStl3 



IX <£ *C "*1 2 3 4 3 



7 8 9 13 11 12 13 



7 ft 9 10 11 12 13 



1S4- 
bp>| 



JX -C +C i S 3 4 5 6 7 * 5 10 11 12 13 



Fig* 2 Reverse-transcriptase polymerase chain reaction (RT-PCR) 
for somatostatin receptor (SSTR) l, 2, 3, 4 and 5 mRNA in 33 
pancreatic endocrine tumors (lanes 1-33) (a) and in 13 intestinal 
carcinoids (lanes 1-13) (b). The first column to the left represents 
the molecular weight marker. Columns labeled -C and +C stand 
for negative and positive controls, respectively (see text) 



whereas anti-SSTR3 and SSTR5 antibodies had a prefer- 
ential membrane staining in most cases or a combined 
membrane and cytoplasmic reactivity in others (Fig. 3). 

The majority of cases had a concordant mRNA ex- 
pression, as detected by RT-PCR analysis, and protein 
expression, as shown by IHC. Discrepant cases included 
two tumors that failed to stain with anti-SSTR2 anti- 
bodies, despite a positive RT-PCR. In addition, a focal 
immunoreactivity for SSTRs 2, 3 and 5 was observed in 
one, one and four cases, respectively, in the absence of 
the corresponding mRNA amplification. This is likely 
due to tumor heterogeneity, since most cases had a focal 
SSTR immunoreactivity and, although care was taken to 
freeze tumor fragments adjacent to those used for con- 
ventional histopathologicai analysis, it cannot be exclud- 
ed that the tumor cell populations of the two samples had 
a divergent differentiation pattern. 

IHC was performed with anti-SSTR2, 3 and 5 anti- 
bodies in 22 additional tumors from 20 patients, in 
which frozen tissues were not available. Two patients 
(case nos. 35 and 39 - Table 3) had two separate tumor 
nodules producing a different hormone. For the purpose 
of correlative analysis between SSTR expression and 
hormone production, they were kept separate. SSTR2 
immunoreactivity was found in 15 of 22 tumors (68.2%), 
SSTR3 in 8 of 22 (36.4%) and SSTR5 in 14 of 22 
(63.6%). 

Taking together all data on SSTR expression, it ap- 
peared that the majority of pancreatic endocrine tumors 
were expressing multiple SSTR subtypes, being at least 
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Fig. 3 a Case 32. Pancreatic gastrinoma diffusely positive (score 
+++) for somatostatin receptor (SSTR)2A at the cell membrane 
level, b Case 31. Pancreatic insulinoma showing a focal (score '+) 
positivity for SSTR2A with predominant membraneous pattern, 
c Case 31. Strong membrane staining for SSTR3 in most tumor 
cells (score +++■). d Case 3. A weak cytoplasmic staining in single 
tumor cells with membrane reinforcement is present in a pancreat- 
ic insulinoma (score +). e Case 37. Diffuse cytoplasmic staining 
with membrane reinforcement in a pancreatic somatostatiuoma us- 
ing the novel MK86 antibody to SSTR5 (score +++). f Case 15. 
Heterogeneous SSTR5 staining in a nou-functiouing pancreatic tu- 
mor with both cytoplasmic and membranous pattern (score ++). 
(a-f x200). g-i Case 8. The pre-operative fine needle aspirate of 
this insulinoma metastatic to the liver shows a tumor cell cluster in 

three SSTR types present in 75% of cases and being only 
four of 55 tumors completely negative. 

The SSTR immunoreactivity pattern was overlapping 
in lymph-node or liver metastases studied in parallel 
with the corresponding primary rumor. In addition, the 
pre-operative biopsy material (fine needle aspirate of a 
lymph-node or liver metastasis) of three cases gave over- 



the cell block (g) positive for chromogranin A (h) and for 
SSTR2A, which is localized at the cell membrane of most tumor 
cells (i) (g-i x400). l-n Case 14. Non-functioning pancreatic tu- 
mor. The same tumor area expresses SSTR2 at protein level (posi- 
tive SSTR2A immunoperoxidase) (I) and also at mRNA level 
(positive ISH reaction) (m). No staining is present hybridizing a 
parallel section with an unrelated probe (n) (l-n X400). o Expres- 
sion of SSTR in normal pancreatic islets. Relative percentages of 
SSTR2, SSTR3 and SSTR5 co-localization in normal pancreatic 
islet cells (left). An example of double immunofluorescence stain- 
ing of somatostatin in delta cells (red) and SSTR5 (green) is 
shown to the right. All delta cells are positive for SSTR5 (yellow 
color), showing either cell membrane (arrows) or cytoplasmic 
(arrowheads) staining (x400) 

lapping results relative to the corresponding surgically 
resected tumor (Fig. 3). 

In six cases, ISH for SSTR2 mRNA was performed 
and showed a concordant pattern with respect to RT-PCR 
data. A diffuse cytoplasmic positivity was present in the 
positive cases (Fig. 3), but not in parallel sections stained 
with an unrelated or sense probe. 
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Fig. 3 g-o 

When the SSTR expression was evaluated with regard 
to hormonal activity of the tumors, it turned out that 
SSTRs were heterogeneously distributed among different 
functioning or non-functioning endocrine tumors (Fig. 3). 
Generally, SSTR1 was diffusely expressed in endocrine 
tumors of the pancreas, as well as SSTR2, which was 
present in all glucagon (five cases) and gastrin-producing 
tumors (eight cases, including three located in the duode- 
num). SSTR3 and SSTR5 were expressed in approxi- 
mately two-thirds of rumors, with no apparent restriction 
to a given hormonally active tumor, with the only excep- 
tion of SSTR5 expressed in all four somatostatin-produc- 
ing tumors. Finally, SSTR4 was poorly expressed in pan- 
creatic endocrine tumors, being the specific mRNA pres- 
ent in only 8 of 33 tumors (24.2%). Interestingly, all such 
cases were insulinomas. Non-functioning tumors were 
not specifically associated with a specific SSTR subtype, 
irrespective of their size, tumor grade or biological be- 
havior. Two cases, both high grade and metastatic tumors, 
were negative for ail SSTR subtypes. 



Normal pancreatic islets 

The immunohistochemical expression of SSTRs 2, 3 and 
5 was controlled in peritumoral pancreatic islets and in 
two control cases of normal pancreas, using double 



stainings with two-color immunofluorescence. SSTR2 
was present in all cell types, co-localizing with 70% of 
alpha cells, 75% of beta cells and 55% of delta cells. 
SSTR3 co-localized with 70% of alpha cells, 60% of be- 
ta cells and less than one-third of delta cells. SSTR5 was 
present in 80% of alpha cells, 90% of beta cells, and in 
virtually all delta cells (95%; Fig. 3). 



SSTR in gastrointestinal endocrine tumors 

RT-PCR analysis, performed in 13 cases, showed mRNA 
amplification of SSTR1 in 12 of 13 (92.3%), of SSTR2 
in 13 of 13 (100%), of SSTR3 in 7 of 13 (53.8%), of 
SSTR4 in 2 of 13 (15.4%) and of SSTR5 in 10 of 13 
(76.9%) cases (Fig. 2b, Table 4). All the cases analyzed 
had a detectable level of the housekeeping gene p2-mi- 
croglobulin mRNA, thus confirming the total mRNA 
quality. IHC was performed with anti-SSTR2, 3 and 5 
antibodies. The immunohistochemical reactivity was ex- 
tremely heterogeneous, as for pancreatic tumors. A cell 
membrane immunoreactivity was present with anti- 
SSTR2 antibodies, while in SSTR3 and SSTRS stained 
cases, cell membrane and cytoplasmic reactivities were 
combined in some cases (Fig. 4). 

The majority of cases had concordant mRNA and 
protein expression. Discrepant cases had a focal immu- 
noreactivity for SSTR2, 3 and 5 in one, two and one 
cases, respectively, despite a negative RT-PCR reaction. 
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IHC alone was performed with anti-SSTR2, 3 and 5 
antibodies in 15 additional tumors, 8 of which had 
lymph-node and/or liver metastases. SSTR2 immunore- 
activity was found in 12 of 15 tumors (80%), SSTR3 in 9 
of 15 (60%) and SSTR5 in 12 of 15 (80%). In four cases 
of gastric body carcinoid associated with chronic atro- 
phic gastritis and hyperplasia of neuroendocrine ECL 
cells, small hyperplastic nests as well as single endocrine 
cells scattered in the gastric glands were also positive for 
SSTR2 immunostaining (Fig. 4). 

Taking together all data on SSTR expression, it ap- 
peared that the majority of gastrointestinal endocrine tu- 
mors were expressing multiple SSTR subtypes, with at 
least three SSTR types present in 90% of cases (with one 
tumor expressing all five types and three others having 
four of five SSTR types expressed) and with only one 
case completely negative (a poorly differentiated endo- 
crine carcinoma). 

As for the pancreatic tumors, the SSTR immunoreac- 
tivity pattern was overlapping in lymph-node or liver 
metastases relative to that of the primary tumor, as well 
as in pre-operative biopsy material (fine needle aspirate 
of a liver metastasis; Fig. 4) and in the corresponding 
surgically resected tumor. 

In six cases, ISH for SSTR2 mRNA was performed 
and showed a concordant pattern with respect to RT-PCR 
data. A diffuse cytoplasmic staining was present in the 
positive cases, but not in parallel sections stained with an 
unrelated or sense probe (Fig. 4). 



Clinico-pathological correlation 

No statistically significant correlation was observed be- 
tween SSTR expression (any type) and either sex, age, 
tumor size, tumor location or biological behavior. With 
regard to tumor grade, the rare poorly differentiated en- 
docrine carcinomas (five cases) appeared to express 
SSTR to a lower extent, although the figures are too 
small to allow a definite conclusion on this point. Com- 
paring the extent of SSTR expression with the clinical 
outcome in the 54 patients in which this information was 



Fig4a-c Case 8, Duodenal gastrinoma metastatic to a lymph ► 
node. Somatostatin receptor (SSTR)2 is diffusely expressed as 
detected by positive immuuoreaction (a, score +-H-) and positive 
in situ hybridization (ISH) for SSTR2 mRNA (b). No stain is pres- 
ent iu a parallel section hybridized with an unrelated probe (c). 
(a-c x200). d Case 9. Ileal endocrine tumor showing strong mem- 
brane SSTR3 immunoreactivity in most tumor cells (score +++) 
(x400). e Case 16. Ileal endocrine tumor with a predominant 
membranous pattern of staining with the SSTR5 antibody (score 
+++). A normal intestinal gland (top) is uegative (x400). f-g Case 
28. A gastric carcinoid associated with chronic atrophic gastritis 
and neuroendocrine ECL- cell hyperplasia (as shown by chromo- 
grauiu A immunostaining) (f) are both positive for SSTR2 inimu- 
uoreaction (g, arrows), (f-g x200); h-j Case 6. The pre-operative 
fine needle aspirate of an ileal carcinoid metastatic to the liver 
shows tumor cell clusters in the cell block (h) diffusely positive 
for SSTR2A (i) with the staining localized at the cell membrane 
(j)(h-i X200,jx400) 
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Fig 4 a-c Legend see page 469 
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available, it appeared that no specific association was 
present between SSTR content and biological behavior. 
Finally, comparing pancreatic and gastrointestinal endo- 
crine tumors, it can be noticed that the percentage of 
SSTR subtype expression in the two groups is relatively 
similar, with SSTRs 1 and 2 present in the vast majority 
of GEP tumors, SSTRs 3 and 5 expressed by approxi- 
mately two-thirds of cases and SSTR 4 only occasionally 
represented in these tumors (mostly insulinomas). 



Discussion 

IHC using specific antibodies is a reliable tool 
to reveal SSTR 

In 46 GEP endocrine tumors, SSTR 1-5 expression was 
found in a consistent percentage of cases with the excep- 
tion of SSTR4, which was poorly represented (20%). 
These findings are the result of RT-PCR and immunohisto- 
chemical analyses. The literature data, mostly obtained 
from small case series (with few exceptions) [12, 49] are 
largely heterogeneous both in terms of SSTR expression 
results and methods employed to demonstrate them. On 
the contrary, we analyzed SSTR types 1-5 in a relatively 
large series of tumors and compared two sensitive tech- 
niques - RT-PCR and IHC with tyramide amplification. 
With regard to the former, few studies have detected SSTR 
types 1-5 mRNA in GEP tumors. While most authors 
agree on a high (>60%) SSTR1 and 2 and moderate (50%) 
SSTR5 expression, conflicting results were observed for 
SSTRs 3 and 4, which were found in nearly half of the 
cases of endocrine pancreatic tumors by some authors [12, 
32], but very poorly represented according to others [49]. 
IHC, however, is the last entry in the list of techniques 
used to reveal SSTR, and takes advantage from the possi- 
bility of a topographical SSTR localization within tumor 
cells using highly specific antibodies. So far, only 
SSTR2A immunoreactivity has been thoroughly character- 
ized in GEP endocrine tumors, with other SSTR types so 
far having been detected by mRNA analysis alone. 

In the current study, SSTRs 2, 3 and 5 were studied 
by means of IHC with specific antibodies and were 
found to correlate with RT-PCR data in over 90% of 
cases, supporting the data already published on SSTR2A 
irnmunohistochemical detection [14, 33] and expanding 
them to SSTR types 3 and 5. The latter were detected 
using a commercial antibody to SSTR3 and a novel anti- 
body to SSTRS developed in our laboratory. Both 
SSTR3 and SSTR5 immunostainings showed a preferen- 
tial cell membrane subcellular distribution, as typically 
observed for SSTR2A [24, 33]. In some cases, however, 
an associated cytoplasmic reactivity was also found, as 
described for SSTR2A in tumors of other organs, too 
[38]. A similar pattern of reactivity for SSTRs 2, 3 and 5 
was also observed in biopsy or fine needle aspirate cytol- 
ogy specimens obtained pre-operatively from metastatic 
liver nodules or primary pancreatic tumors, relative to 
the corresponding surgical samples. 
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This indicates that the SSTR profile can be investigat- 
ed using IHC in the course of the diagnostic work-up of 
a GEP tumor directly on small pre-operative biopsy 
specimens. A negative result, however, might not defi- 
nitely exclude the presence of a given SSTR type, since 
20-30% of operated cases had a focal immunoreactivity 
and the biopsy sample may not contain SSTR immun- 
oreacive cells. In any case, a positive immunohistochem- 
ical SSTR detection may open new prospects for so- 
matostatin analogue-based radio-imaging and radio- 
therapy of GEP endocrine tumors. 



Interpretation criteria of SSTR expression need 
to be defined 

Cases showing discrepant results between IHC and 
RT-PCR accounted for up to 10% of cases in each SSTR 
type (namely, four, three and five cases for SSTRs 2, 
3 and 5, respectively). These included two tumors posi- 
tive by RT-PCR alone that failed to show the correspond- 
ing SSTR immunoreactivity, a finding possibly due to 
poor antigen preservation after fixation or low amount of 
receptor protein (below the threshold of detectability of 
immunohistochemical procedure). The remaining cases 
were negative by RT-PCR and had a focal reactivity (less 
than 20%) in tumor cells. This unexpected finding is 
possibly related to tumor heterogeneity, although care 
had been taken to freeze a fragment adjacent to that used 
for conventional histopathological examination. Consid- 
ering weak or focally positive reactions, it remains to be 
ascertained whether the immunohistochemical evalua- 
tion (and probably also SSTR evaluation in general, with 
any method) may require a semi-quantitative procedure 
or a cut-off value, to better interpret borderline findings. 
An answer to this point will stem from clinical studies 
on large series of GEP tumors treated with somatostatin 
analogues. The, available literature data [10, 13, 19, 22, 
26, 36] showed that a relatively good correlation exists 
between SSTR expression (evaluated by autoradiogra- 
phy or RT-PCR) and in vivo detection of functional re- 
ceptors by means of octreotide scintigraphy, although 
discrepant cases were also found [13]. Unfortunately, no 
cases of the present series underwent clinical evaluation 
with octreotide. 



Correlative SSTR expression in pancreatic islet cells 
and related endocrine tumors 

Although not specifically designed as a study on normal 
pancreatic islets, we had the opportunity of mapping the 
individual SSTR types in peritumoral pancreatic islet 
cells, as well as in two normal pancreatic tissues used as 
controls. This was of special interest because of discrep- 
ant data on the immunohistochemical localization of 
SSTR in pancreatic endocrine cell types. Kumar et al. 
[18] found SSTRs 1 and 2 in alpha and beta cells and 
SSTR5 in beta and delta cells, with SSTRs 3 and 4 poor- 
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ly represented in the pancreas. The data on SSTR2 are in 
agreement with the findings of Reubi et al. in human 
[34] and Mitra et al. [21] and Hunyadi et al. [11] in rat 
pancreas. Conversely, Portela-Gomes and co-workers 
[28] observed a wide occurrence of SSTR subtypes in all 
pancreatic endocrine cells, (range 60-100% of individual 
cell types). In this study, considering specifically only a 
predominant cell membrane immunoreactivity, we found 
SSTR2 expressed in 70%, 75% and 55% of alpha, beta 
and delta cells, respectively; SSTR3 expressed in two- 
thirds of alpha and beta cells but in less than 30% of 
delta cells; and finally SSTR5 co-localized in virtually 
all delta cells and in 80-90% of alpha and beta cells. 
These figures are in agreement those of Portela-Gomes 
et al. [28] and Kumar et al. [18] on SSTR5 in delta cells 
and Reubi et al. [34] and Stiowski et al. [47] on SSTR2 
in alpha and beta cells. The results of Kumar et al. [18] 
on other SSTR types are slightly different, possibly due 
to the fact that these authors used antibodies to rat 
SSTRs. In particular, SSTR3 expression was very low in 
the experience of both Kumar et al. [17] and Kubota 
et al. [16], while both RT-PCR and IHC were strongly 
positive in our cases. 

With regard to SSTR expression in pancreatic neo- 
plasms, SSTR2 was widely expressed by gastrin- and 
glucagon-producing tumors (100% of cases), including 
three duodenal gastrinomas. In addition, all somatosta- 
tin-producing tumors were SSTR5 positive, a finding 
which parallels the profile of normal delta cells, as de- 
scribed above. Incidentally, three of four cases had a 
combined membrane and cytoplasmic positivity, in 
agreement with the data of Reubi et al. [38] on SSTR2A 
immunoreactivity in somatostatinomas. As far as insulin- 
producing tumors are concerned, SSTR2 was present in 
76% of tumors, thus confirming that these tumors are 
rich in SSTR2, just as normal beta cells are [14, 33, 34, 
49]. Irrespective of the hormonal product, most tumors 
expressed SSTR1 (as demonstrated by RT-PCR data 
only) and two-thirds of them were rich in SSTR3. This is 
at variance with Wulbrand's findings [49], who showed 
that SSTR3 was absent in the pancreatic tumors. This 
difference is surprising since in Wulbrand's [49] and in 
the present studies, the same primer sequences were used 
for RT-PCR. Apart from possible technical details (e.g., 
tissue preservation and storage), other factors related to 
individual tumors may account for these discrepancies. 
Only 20% of cases expressed SSTR4, being all such 
cases insulinomas. Finally, non-functioning tumors did 
not appear to be associated with a special SSTR expres- 
sion profile. The relatively large series of 14 cases here 
studied showed a heterogeneous pattern of SSTR expres- 
sion, as also observed in the seven non-functioning pan- 
creatic tumors of Wulbrand's series [49]. Nevertheless, 
the amount of receptors identified in all but two cases 
(both aggressive tumors) suggests that somatostatin ana- 
logue treatment may be a useful adjunct in these tumors, 
as also suggested by Bartsch [1]. 

A separate comment is deserved for the two cases 
with multiple pancreatic nodules producing different 
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hormones. Interestingly, IHC (and ISH for SSTR2) could 
demonstrate a selective expression of SSTR subtypes in 
different nodules, with SSTRs 2 and 5 confined to the in- 
sulin and somatostatin-producing nodules, respectively, 
in one case, and with SSTRs 2 and 3 present in the glu- 
cagon-producing tumor, but not in the insulin -positive 
nodule, of the second case. The clinical implications of 
this differential SSTR expression in combined endocrine 
tumors are not fully understood, but such a receptor pro- 
file might better address the therapeutic strategy with 
subtype- selective somatostatin analogues [6, 36, 37], in 
the case of inoperable disease or recurrent tumors. 

The observed correlation between the SSTR sub- 
type^) expression in normal islet cells and in their relat- 
ed hormone-producing tumors, was also detected in nor- 
ma 1/hyperplastic neuroendocrine ECL cells of the oxyn- 
tic gastric mucosa and their related carcinoid tumors. 

Possible association of reduced SSTR expression 
and tumor grade 

Taking into account clinical and pathological parameters 
of GEP endocrine tumors, we were unable to disclose 
any specific association between SSTR subtype expres- 
sion and either sex, age, tumor location and size, tumor 
stage or clinical outcome (in the 54 cases in which this 
information was obtained). In GEP adenocarcinomas, 
however, an inverse relationship between SSTR2 expres- 
sion and tumor stage was found [3]. With regard to tu- 
mor-grade, as observed in tumors of other locations (e.g., 
lung) [24], poorly differentiated endocrine carcinomas 
had an apparently reduced SSTR content. Three of five 
cases (60%) of poorly differentiated carcinomas had no 
or only one SSTR type focaily expressed in their tumor 
cell population (as opposed to 12% of well-differentiated 
cases with only one or no SSTR type expressed). 



Conclusion 

a. SSTR expression in GEP endocrine tumors is hetero- 
geneous; SSTRs 1 and 2 are the types most common- 
ly detected (over 80%), followed by SSTRs 3 and 5 
present in two-thirds of cases and by SSTR4 which is 
poorly represented in GEP tumors. 

b. RT-PCR is a highly sensitive technique to reveal 
SSTRs but needs to be combined with methods that 
allow confirmation of the receptor protein localization 
in tumor cells. 

c. Immunohistochemical detection of SSTR2, and also 
of SSTRs 3 and 5, is a reliable procedure for the study 
of formalin-fixed tissues, including biopsy and cyto- 
logical pre-operative material. 

d. Hormonally active pancreatic tumors have a prefer- 
ential SSTR5 expression in "somatostinomas" and 
SSTR2 positivity in "gastrinomas" and "glucagono- 
mas". "Insulinomas" express all SSTR types (ranging 
from 40% to 90% of cases). Non-functioning tumors 
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express all SSTR types (but SSTR4) in 60-90% of 
cases. 

e. A correlation was observed between SSTR subtype 
expression in normal islet cells and neuroendocrine 
gastric cells and in the corresponding endocrine tu- 
mors. 

f. SSTR expression is not related to tumor stage or out- 
come but is related to tumor grade, with high-grade 
endocrine carcinomas having a reduced SSTR con- 
tent. 
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Abstract 

We studied whether induction of glucose 
transporters (GUJTs) I to 4 correlates with human 
papillomavirus (HPV)-dependem malignant 
transformation of cervical epithelium. Tissue samples of 
cervical intraepithelial neoplasia (CIN; grades 1 to 3), 
invasive caivinonws, and lymph node metastasis were 
examined. HPV typing was performed. Tissue sections 
were immxmostained with GLUT1 to GLUT4 
antibodies. Messenger RNA (mRNA)in situ 
: hybridization confirmed GLUT! protein expression. 

Weak expression of GLUT1 was found in 
nondysplastic HPV-positive and HPV-negative 
epithelium; significant expression was observed in 
preneoplastic lesions, correlating with the degree of 
dysplasia. In CIN 3 high-risk HPV lesions, cervical 
cancer, and metastasis, GLUT I was expressed at 
highest levels with a strong correlation of GLUT I 
mRNA and protein expression. Immunostains for 
GLUT2 to GLUT4 were negative. 

Cervical turru)r cells respond to enhanced glucose 
utilization by up-regulation of GLUT J. The strong 
induction ofGLUTI mRNA cuui protein in HPV-positive 
CIN 3 lesions suggests GLUT] overexpression as an 
early event in cervical neoplasia. GLUT! is potentially 
relevant as a diagnostic tool and glucose metabolism as 
a therapeutic target in cervical cancer. 
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ft has been known for several decades that tumor cells 
show enhanced glucose metabolism compared with benign 
tissues. 1 A continuous supply of glucose is the predominant 
source of adenosine triphosphate generation and substrate 
storage in mammalian cells. Molecular cloning of the glucose 
transporter prolein 1 (GLUT1 > chat catalyzes the uptake of 
glucose in erythrocytes 2 and the subsequent identification of 
homologous genes (GLUT2, GLUTS, GIJUTA, and GLUT5) 
revealed a family of genes to facilitate diffusion of hexoses 
into mammalian cells. 3 On (he basis of sequence similarities 
and functional characteristics, GLUTs can be divided into 3 
subfamilies, referred to as class T (GLUTs 1-4), el ass 11 
(GLUTs 5, 7, 9, and 1 1), and class III (GLUTs 6, 8, 10, and 
12 and the myoinositol transporter HMiT I). 4 

Altered expression of glucose transporter proteins has 
been described in different tissues under various conditions 
such as transformation by oncogene*, hypoxia, and expo- 
sure to insulin. 5 in vivo, the storage of glycogen has been 
related to the differentiation status of squamous cell 
epithelia. 67 In glucose-deprived 3T3-L1 adipocytes, 
GLUT1 activation by glucosylation relates to increased 
storage of glycogen.* Cell culture experiments showed that 
the expression of GLUT I transporter! protein is induced by 
certain oncogenes such as ras and src 9,w and regulated by 
the growth factors such as platelet-derived growth factor 
and epidermal growth factor. 11,12 i 

Furthermore, members of the class 1 family of glucose 
transporter proteins were expressed strongly in various solid 
tumors such as breast cancer,' Xi4 renal cell carcinomas, 1 *' 16 
brain tunx)rs/ 7 and gastrointestinal malignomas. ,a So far, class 
II and III GLUT members have nirt been observed to be rele- 
vant for mmorigenesis or tumor progression.' 9 
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Little is known about the expression pattern of GLUTs in 
cervical neoplasms. Increased glucose uptake demonstrated by 
positron emission tomography (PET) 20 suggests a potential 
pathophysiologic role of glucose metabolism in cervical 
cancer. The malignant progression of cervical carcinomas 
from preinvasive lesions may serve as a paradigm tor tumori- 
geuesis along well-defined pathologic stages and pathogen- 
esis. The development of cervical neoplasia is associated with 
the human papillomavirus (11PV). Viral migration guarantees 
perpetual expression of KPV-related viral oncogenes. These 
molecular changes arc produced most consistently by the 
prototypic high-risk HPV types (eg, 16, 18, 31, and 33), but 
many others (low- risk types, eg, 6, 1 1, and 40) muy be associ- 
ated with invasive cancer. 

Wc studied the relationship of HPV-related cervical tumor 
traiisfonnation, cellular glycogen storage and expression, and 
distribution of GTJUTt, GLUT2, GLUT3, and GLUT4. 

Materials and Methods 

Tissue sections were prepared from formal in- fixed, 
paraifin-efnbcdded archival specimens of cervical neoplasms 
obtained by surgery. Of 90 available tissue samples from cone 
biopsies performed for preneoplastic cervical lesions, 18 were 
mild cervical intraepithelial neoplasia (ClN 1), 24 were 
moderate (ClN 2)« and 48 were severe dysplasia lesions (CIN 
3). The cervical carcinoma specimens and the cervical lymph 
node metastasis specimens were obtained from 94 patients 
who underwent radical surgical resections at the Department 
of Gynecology and Obstetrics, University Hospital, Aachen. 
Germany, between August 1995 and September 1997. Tissue 
specimens from 22 pelvic lymph node metastases were 
studied. 'Hie histologic diagnosis and grade of dysplasia were 
confirmed by 2 pathologists (R.B. and MM.) according to the 
criteria of the International Federation of Gynecology and 
Obstetrics (FTGO classification) and the World Health Organi- 
zation. 21 The tissue specimens were examined histologically 
ajid contained predominantly neoplastic cells, aittf the popula- 
tion of tumor cells was no less than 50% of all cells in the 
specimens. Paraffin-fixed tissue sections Jrom 34 unselected 
benign cervical lesions were used as control samples and 
contained changes such as inflammatory or squamous alypia, 
immature metaplasia, or basal cell hyperplasia. 

For inimunohistocliemical analysis, 3-jjrn sections were 
depararTinized and rehydrated. After rinsing with phosphate- 
buffered saline, endogenous peroxidase activity was inhib- 
ited by incubation in 3% hydrogen peroxide solution for 30 
minutes, and the slides were incubated in a 1:10 solution of 
equine normal serum to block nonspecific staining. 

After antigen retrieval by microwave treatment (600 W) 
for 5 minutes (3 times) in a 10-mmol/L concentration of 
sodium citrate, slides were incubated for 60 minutes with 
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polyclonal rabbit antihumaa GLUTI (DAKO, Hamburg, 
Germany) and polyclonal rabbit GLUT2, GLUT3, and 
GLUT4 antibodies (Biogenesis, New Field, England). The 
antibodies were diluted with bovine serum albumin 
according to the manufacturer's instructions. As a negative 
control, slides were reacted with equal amounts of nonspe- 
cific IgG instead of the specific primary antibody. After 
rinsing in phosphate-buffered saline, slides were incubated 
with a 1:50 dilution of biotinylated secondary antibody 
(DAKO) for 30 minutes and developed using the ABC tech- 
nique (Vect&stain ABC Systems. Vector Laboratories, Peter- 
borough, England) according to the manufacturer's instruc- 
tions. Immunocomplexes were visualized with the 
chromogen diaminobenzidine. Staining for glycogen was 
performed by the periodic acid-Schiff (PAS) reaction and 
counterstaining with tomidine blue. 

The specificity of GLUTI staining was confirmed 
routinely in all cases by staining of parallel tissue sections 
with GLUTI antiserum that had been preincubated with the 
immunizing peptide (25 ug/mL for 1 hour at 25°C). GLUTI 
irmiumostaining was blocked by this peptide competition in 
all cases. The sections wens evaluated semiquantitatively for 
immunoreactivity and PAS staining, based on a scoring 
system thai was defined by the product of the intensity of the 
staining and the percentage of the stained cells. The speci- 
mens were subdivided into strong (50% or more marked 
cells), moderate (25%-49%), weak (IO%-24%), and not 
marked for GLUTI and glycogen (<10%). Grading was 
performed by consensus of 2 observers (R.B. and MM). in a 
blinded manner without previous knowledge of the 
histopatliologic and clinical data for each case- 
In situ hybridization was performed on paraffin- 
embedded tissue samples from different cervical biopsies as 
described previously 22 using phosphorus 33 ( 33 P)-labeled 
sense and antisense complementary RNA riboprobes for rat 
GLUTI. Briefly, Proteinase K-pretreated slides (10 ug/mL) 
were acetylated in acetic anhydride diluted 1:400 in a 0.1- 
mol/L concentration of triethanolaminc (pH 8.0) and 
hybridized overnight in 50% formamide, 10% dextran 
sulfate, a 10-mmol/L concentration of tris(hydroxymcLhyl) 
aminomethane (pH 8.0), a 10-mmol/L concentration of 
sodium phosphate (pH 7.0), 2x standard saline citrate, a 5- 
mmol/L concentration of EDTA (pH 8.0), 150-ug/mL 
concentration of transfer RNA, a 10-mmol/L concentration 
of dithiothrcitot, and a 10-mmol/L concentration of p- 
mercaptoethanol supplemented with 5 x 10 4 cpm/pL of °P- 
labeled sense or antisense riboprobes at 50 U C. Finally, slides 
were washed twice in 50% formamide-2x standard saline 
citrate 20-mmol/L concentration of p-mercaptoethanol. 
digested with RNase A (20 ug/mL) for 30 minutes at 37°C, 
and washed again 3 times with the same washing buffer for 
30 minutes each at 50"C. After dehydrating, slides were 

© American Society tor Ciimcnl Pomology 



PACE 3/8 « RCVD AT 5/14/2007 6:51:16 PM {Pacific Standard TlmeJ * 3VR:SVCS01/1 * ONI8:6034 • CSID:(613) 991-5715 * DURATION <mm-ss):09-H 




Order # 06861519DP05740909 Mon 14 May 2007 09:50:46 PM EDT Page 4 of 9 

Anatomic Pathology / Ordinal Article 



From (613) 991-5715 




. . coated with Kodak NTB2 emulsion (KodaL Rochester, NY) 
I : ■ . ; and exposed for 8 to 10 days. The intensity of the silver 
: , staining was examined by darkfield microscopy, and digical- 
! '• ized images were examined using I mage J L27z. software 
•;. (National Institutes of Health, Bethesda, MD). The Student t 
; ■ ; , test was used for statistical analysis. 

' : Fur detection and typing of HPV. tissue sections were 
digested by Proteinase K, and DNA was extracted using the 
DNA Mini Kit (Qiagen, Hilden, Germany) according to the 
f r manufacturer's instructions. To detect HPV DNA, a 2-ticred 
- , ; polymerase chain reacrion (PCR) -direct sequencing method 
.! was performed according to Fcoli-Fonseca ct aP with slight 
modifications. We used Lhe general consensus primers 
i ;? * GP$+/GP6+ and the MY09/MY1 1 primers for amplification of 
: HPV DNA. Forty cycles of ajnpliftcation were run with an 
\ j :;. initial denaturation at 95°C for 5 minutes and for 30 seconds in 
: } \P ■'. each cycle. Temperatures of 37°C and 58 W C. were set for 30 
seconds for annealing of the primer pairs GP5+/GP6+ and 
MY09/MY I 1> respectively. An extension step was done at 
I W .■■ 72°C for 5 minutes. The integrity of human genomic DNA was 
r : i; .. verified by PCR amplification of the p-globin gene. Tliis reac- 
tion served as a positive control. The amplification products of 
• ■ the 2 consensus primer pairs atid the $-globiri PCR were run on 
I- - r a 2% agarose gel and stained with ethidium bromide. 
\ PCR products were purified using the High Pure PCR 
, : : product purification kit (Roche Diagnostics, Mannheim, 
Germany) according to the manufacturers instructions. The 
sequence of I strand of the purified PCR fragments was deler- 
mined with the Big-Dye Terminator sequencing kit (Applied 
Biosy stems, Foster City, CA) using 3 to 5 pmol of GP5+ or 
VH{ ; .'- MY09 as trie sequencing primers. The- results of the 
; v :. sequencing reactions were analyzed on an ABT Prism 310 
■ : 'f".' automated sequencer (Applied Brosystems). The obtained 
: v-; sequences were compared with documented virus sequences 
; available in the GenBank databank using the BLAST program 
; (Blast, Pittsboro, NC), The high-risk group comprised HPV 
! ,;' types 16, 18, 31, 33,35, 39,45, 51.52,56. 58, 59, 66, and 68, 
! 4. . and the low -risk group included IIPV rypes 6, 1 1, 40 T 42, 43, 
and 44. The Student t tesl was used tor statistical analysis. 

Results 

GLUT I -expressing epithelial cells were observed in 
]'■*:■' squamous epithelial cells of normal and dysplasnc cervical 
i,. • tissue probes. However, GLUT1 expression in normal 
|i V;. epithelial cells was limited to the basal cell layers llmagc 
j 1AI. Staining intensity was weak and varied in the basal 

compartment from a focal to homogeneous pattern disiribu- 
;;■ tion. All 34 specimens obtained from benign cervical tissue 

were classified as weak or not marked. Compared with 
\ i i). , GLUTl expression in the ijroliferative cell compartmenr in 
L V the basal cell layer, changes such as inflammatory atypm, 



metaplasia, and hyperplasia showed similar GLUT1 exprcs- 1 
si on patterns and also were classified as weakly marked. ■ \ 

The intensity of GLUT I expression was unaltered from j 
the HPV status. Of 14 HPV positive specimens* 6 showed ; : j 
high-risk positivity and 8 showed low-risk positivity liable II. \ : ' 

Erythrocytes and vascular endothelium served as |; 
internal positive controls and demonstrated strong GLUT1 j \ 

positivity. Simultaneous PAS staining of the epithelium 
revealed the glycogen content to be correlated inversely to ; ;l 

the expression of GLUT! IFigure II. The glycogen staining l lj 

was limited predominantly lo the upper cell compartments, \ :j 

which contain the functional epithelium. In good agreement . i 

with these immunostaining results, in situ hybridization : jj 

showed GLUT1 messenger RNA (mRNA) expression to be [j; 
present specifically in the basal epithelium llmage 1BI. \ j! 

In mildly dysplastic epithelium (n ~ 18). weak GLUT1 j i 

expression was limiled to the basal cell layers llmage 1CI and j '! 

Ilmage IM, and in moderate dysplasia (n = 24), enhanced j : : 

expression was extended to the basal and parabasal compart- 
ments. The GLUT I expression pattern was i ndependent of 
HPV status. In CIN 1 lesions, 4 cases were in lhe lugh-risk 
HPV* group, 6 were in the low : risk HPV group, and 8 were 
HPV-negative cases. Compared with benign cervical changes, 
significantly increased GLUT I expression was observed in 
CIN 1 lesions llmuge 21. 

Strong GLUTl mRNA and protein expression was 
observed in all 48 CIN 3 lesions, of the cervix uteri lloiage 
IFM and llmagc 1FI. GLUTl expression was continuously 
strong throughout the entire dyspiastic epithelium, including 
the basal, suprabasal, and apical parts of the cervical epithe- 
lium. Of 48.CLN 3 lesions, 46 were HPV positive; 34 cases 
belong to the high-risk group and 12 to tlx; low-risk group. 
No differences between the HPV type or status and GLUTl 
expression were observed. Only weak glycogen storage was 
observed in CIN 3 lesional tissue samples. 

Tumors with moderate dysplasia showed a high 
percentage of HPV positivity (high-risk HPV. 16 cases; low- 
risk IIPV, 3 cases; HPV type not assignable, 2 cases; HPV- 
negative. 3 cases). The intensity of GLUTl protein expres- 
sion depended on the extent of dysplasia and was 
accompanied by inversely correlated glycogen content in the 
nondysplastic areas of The q>ithelium (Figure 1 ). 

GLUTl overexpression was demonstrated in all 94 
cases of invasive cervical carcinoma and all 22 cervical 
metastases llmagc 1GI and llmage 1HI. The intensity of 
GLUTl protein expression did not vary significantly 
between different tumor stages, the degree of tumor differen- 
tiation* or HPV status. A high-risk HPV type was detectable 
in 84 cases and a low-risk type in 8 cases. In 2 specimens, no 
HPV was detectable. 

GLUTl expression was observed predominantly at the 
tumor periphery Cttmrxired with the center of wdl-dirTerentiated 
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llraagc II Glucose transporter (GLUT) 1 immunoreactivity with simultaneous periodic acid-Schiff (PAS) staining and GLUT1 
messenger RNA ImRNIAI silver staining in normal cervical epithelium (A, GLUT1 and PAS, x200; B, GLUT1 mRNA and silver 
staining, x200l, cervical intfaepithelial neoplasia <CIN) 1 lesions (C, GLUTI and PAS, x200; D, GLUT1 mRNA and silver staining, 



tumor areas. By PAS staining, we found an accumulation of 
glycogen in well-differentiated lumor areas, which was 
correlated inversely with CLUT1 expression. A similar 
distribution pattern was found in the tyinph node metastases. 
IFigure 21 illustrates the results of GLUT I mRNA in situ 
hybridization. The intensity of the specific silver staining 
correlated significantly with the extent of cervical dysplasia 
and showed similar levels in ON 3 lesions and cervical 
carcinoma. Ovcrcxprcssion of mRNA was limited to 
dysplastic cells and matched the patterns of GLUT1 protein 
distribution revealed by iituiiunohistectemical analysis. 

Imrnunostains for GLLIT2, GLUT3, and GLUT4 were 
negative in epithelial celis of both benign and dysplastic 
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lesions- Only inflammatory cells reacted with the antibodies 
and served as an internal positive control. In the advanced 
and poorly differentiated tumors, there was a pattern of 
expression similar to that in the dysplastic precursor lesions. 

Discussion 

Based on numerous molecular and epidemiologic 
studies, it has been demonstrated that specific types of HPV 
(high-risk HPVs. hut also to a lower extent low-risk HPVn) 
can he detected in a nanscriptionally active form in aboul 
95% of cervical cancer biopsy specimens. 24 Although the 
high-risk HPV - encoded oncogenes E6 and E7 induce 
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x200), CIN 3 lesions (E, GLUT1 and PAS, x200; F, GLlfTl mRNA and silver staining, x200), and cervical cancer <G, GLUT! and 
PAS, x200; H, GLUT1 mRNA and silver staining, x200). Expression of GLUT! mRNA and protein in normal epithelial cells was 
limited to the basal proliferative cell layers, whereas PAS staining was restricted to the upper cell compartments. In the CIN 3 
lesions and cervical carcinoma, strong expression of GLUT1 mRNA and protein was observed. 



iS 

:d 



.c 
V 

>) 
jt 
ie 
e 



HPV Stains, GLUT1 Expression, and PAS Staining in the Study Group* 





Benign Cervical Tissue (u => 34) 


CIN I (n = m 


CIN2(n = 24) 


CTN 3 (n = 48) 


Cervical Cancer (o ■ 94) 


HPV positive 


14 {41) 


10(56) 


21 (68)' 


46 196) 


92 (98) 


High risk 


5(18) 


4 (22) 


16(67) 


34(71) 


84 (89) 


uow risk 


8(24) 


6(33) 


3(13) 


12(251 


8(9) 


GLUT1 expression 




+ 




+++ 


+ + * 


PAS staininy 


+ + + 


+++ 




+ 





CIN. curviL-al iimacpithclial neoplasia; CI AIT. k'ocosc tuiufwier: HPV. human papilktmavtrus; PAS, jxriodk acid -Setoff; negative:-*, weak; ++, moderate: •♦-+ +. siroug. 

* Ouu fur HPV are given as number (pereentaEi;). High-risk HPV in feci ion was related closely k> ilie depee of dysplasia and coincided wim increasing GLUT! eaprc^ion. 

* In 2 c«>e* (8%). the HPV type was nul usMgH^htc. 
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IFignre II Glucose transporter (GLUT) 1 protein expression 
(black bars) and cellular glycogen content (gray bars) in 
benign and dyspiastic cervical epithelium, -The immuno- 
reactivity of GLUT1 was associated strongly with the degree 
of dysplasia and correlated inversely with the cellular 
glycogen content .However, cervical intraepithelial neoplasia 
{CIN) 3 and cervical cancer showed equivalent levels of 
strong GLUT! overexpression. " P < .01. * P< .001. * In the 
center of well-differentiated cervical cancer, mild periodic 
acid-Schift-positive staining was observed. 



cellular immortalixauon, 2 ? their expression alone is not suffi- 
cient to cause malignant transformation. 26 Additional events 
must accumulate to convert a cell toward malignancy and to 
induce cervical cancer. It has been shown that the course of 
malignant disease not only is the result of failing intracellular 



and immunologic surveillance mechanisms 27 but also is 
determined by efficient blood vessel formation to support the 
outgrowing nmior with nutrients, hormones, and oxygen. 28 
Many tumorigenic cells are characterized not only by 
increased mitotic rates but also by high catabolic utilization 
of glucose and an increased number of specific gluco.se 
transporters. uy A sufficient intracellular glucose .supply 
mediated by glueose transporters seems to be a prerequisite 
for enhanced HPV transcription and the expression of the E6 
and E7 oncoproteins. In ceil culture experiments using 
cervical cancer cells, specific suppression of the HPV 18 
transcription was found by adding the glycolytic padway 
inhibitor 2-deoxyglucuse. 30 

Our results demonstrate that human cervical carcinomas 
and their metastases selectively overcxpress GLUT1. In benign 
cervical epithelium, moderate GLUT I expression was restricted 
to the proliferating basal cell layers of the mucosa and changes 
such as inflammatory or squamous atypia, immature meta- 
plasia, or basal cell hyperplasia, These findings arc in line with 
the dinTercntiation-related expression pattern of GLUT1 in 
normal human epidermis, in which GLUT! is expressed in the 
basal layer and, to a lesser extent, in the immediate suprubusal 
layer of die epidermis.-' 1 GLUT1 mRNA expression precisely 
coincided with protein overexpression in normal epithelium, 
cervical dysplasia, and invasive cancers. These findings strongly 
suggest that the increased GLUT I expression in cervical 
neoplasms is mediated by transcriptional mechanisms. 

In vino studies on fibroblasts Iransfcctcd with 7ns and srr 
oncogenes revealed elevated levels of GLU1T expression and 
an increase of iransmembranous glucose transport. 10 These 
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Ilniage 21 Glucose transporter (GLUT) 1 protein immunoreactrvity in cervical intraepithelial neoplasia 1 dysplasia (A, x200) v's 
reparative cervical changes (B, x200h The Gtaining intensity of GLUT1 in low-grade dysplasia was increased markedly 
conn pared with the benign specimen. B, The arrow shows GLLTT1 stoining of erythrocytes, which serve as an internal control 
forGLUTI positivity. 
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results support our present findings indicating that GLUTl over- 
expression parallels malignant hmisfaTuahon of cervical epithe- 
lial cells. However, the regulatory mechanisms of GLUT I 
induction are, so far, poorly understood. Tumor hypoxia might 
be a key step for up-regulation of GLUT! transcript*.- 2 

However, we have observed overexpression in preneo- 
plastic lesions. This might suggest direct oncogene-mggered 
mechanisms as potential factors for GLUT I up-rcgulation. 
GLUTl expression was not correlated with the HPV status 
or type in cither benign or dys plastic cervical epithelium. 
High-risk HFV types were detectable predominantly in C1N 
2 and 3 lesions and invasive cervical cancer and had good 
correlation with the dysplastic stage. All high-risk HPV- 
infected CIN 2 and 3 lesions and carcinoma showed 
enhanced GLUTl expression. There is certain evidence thai 
these cases are associated with virus transcription and the 
expression of E6 and E7 oncogenes. 24 An enhanced cellular 
glucose supply mediated by an increasing number of GLUTl 
transporters could be a requirement for successful HPV- 
dependent malignant transformation. 

Our results indicate that the clinical observation of an 
increased glucose uptake of cervical cancer cells was related 
to an exclusive transmembranous ovcrexpression of GLUTL 
These data strongly suggest thai the observed relationship 
between increased GLUTl expression and the intracellular 
accumulation of 2-ftuoro-deoxy-D-glucose (HDG) in tumor 
cells represents (he* biologic basis for diagnosis using fluo- 
rine 18-labelcd FDG PET. 20 - 33 Enhanced expression of 
GLUTl was detected already in moderate cervical dysplasia 
and much more in carcjwjma in situ. In these preneoplastic 
lesions, the staining pattern of GLUT! expression was 
stronger than in normal or mildly dysplastic mucosa and 
extended into the superficial cell layers depending on the 
degree of dysplasia. The observation that the preneoplastic 
lesions with increased GLUTl expression were not detected 
by PET analysis may be explained by the small dimension of 
the epithelial dysplasia and die limited resolution of PET. 

We also analyzed the cellular glycogen content to deter- 
mine whether alterations in glucose uptake are linked to 
tumor-associated changes in the storage of this polysaccha- 
ride. The glycogen content was of special interest because 
the presence of glycogen is known to be related to cellular 
maturation of squamous epithelium aid disappears with loss 
of differentiation during neoplastic transformation. 6 In the 
present study, glycogen was found only in nondividing cells 
of the superficial layers of the benign cervical epithelium, 
whereas the basal layers did not contain glycogen (Image 
1A). The glycogen content in normal and preneoplastic 
lesions generally correlated inversely with GLUTl expres- 
sion, exhibiting intense glycogen storage in normal and 
reduced or absent storage in dysplasiic epithelium. Thus, 
reduction of glycogen content was related closely to the 




Bonign CIN 1 
Epithelium 



CIN 2 CIN 3 Cervical 
Cancer 



IFtgure 21 Glucose transporter (GLUT) 1 messenger RNA 
(mRNA) expression in the proliferative compartment of the 
benign cervical epithelium (hatched bar) and in dysplastic 
epithelium (black bars). The gray bars represent the relative 
intensity in the functional cell compartments. GLUT1 mRNA 
expression in cervical epithelium correlated significantly with 
the degree of dysplasia. ' P < .01. 1 P< .001. CIN, cervical 
intraepithelial neoplasia. 



degree of dysplasia and was associated strongly with 
increased GLUT I expression. 

The observed storage pattern or glycogen was correlated 
inversely with the extension of the proliferation cornpartrnent 
and is in tine with previous observations showing an association 
between glycogen content and tumor cell development, as well 
as differentiation in squamous cell carcinoma tumorigenesis. 34 
This cellular feature reflects the growing and metastasizing 
potency of cervical carcinomas, as increased utilization of 
glucose is a necessary condition of growing tumors and may be 
the pathophysiologic basis for induction of glucose transporter 
protein expression in the cell membrane. The lack of expression 
in benign cervical tissue and in the well -differentiated center of 
some cervical tumors gives rise to the assumption that GLUTl 
expression jriight be regarded as a characteristic feature of 
tumor cell clones with an increased energy requirement. 

In contrast with data from colon 35 and non-small cell 
pulmonary carcinoma,*' 6 no correlation between GLUTl 
expression and prognostic elinicopathologic features was 
found in the present study. Tins difference can be explained 
easily, since significant overexpression of GLUTl is 
observed in premalignant cervical lesions and well-differen- 
tiated carcinomas. We, therefore, believe that GLUTl over- 
expression is an early event in the development of cervical 
carcinomas. Our findings also may have substantial implica- 
tions for the potential therapeutic application of cytotoxic 
agents conjugated to glucose in the treatment of premalig- 
nant or malignant cervical lesions. Further studies are 
required to explore whether mis strategy will lead to more 
specific drug delivery to the neoplastic cell compartment. 
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Abstract 

Deoxycytidine kinase (dCK.) is required for the phosphorylation of several deoxyribonucleoside analogues thai arc widely 
employed as chemothcrapeutic agents. Examples include cytosine arabinoside (Ara-C) and 2-chlorodeoxyadenosinc (CdA) in the 
treatment of acute myeloid leukaemia (AML) and gemcitabine to treat solid lumours. In this study, expression of dCK mRN A was 
measured by a competitive template reverse transcriptase polymerase chain reaction (CT RT-PCR) in seven cell lines of different 
histological origin, 16 childhood and adult AML samples, 10 human liver samples and It human liver metastases of colorectal 
cancer origin. The enzyme activity and protein expression levels of dCK in the cell lines were closely related to the mRNA expres- 
sion levels (r = 0.75, /> = 0.026 and r = 0.86, 0.007). In AML samples, dCK mRNA expression ranged from 1.16 to 35.25 
{ x \(T*xdCkifractin). In the cell line panel, the range was 2.97-56.9 (x 10" 3 xr/C Kip-act in) otdCK mRNA expression. The enzyme 
activity in liver metastases was correlated to dCK mRNA .expression (r- 0.497, P-0.05). (n the liver samples, these were not cor- 
related. dCK mRNA expression showed only a 36-fold range in liver while a 150-fold range was observed in the liver metastases. In 
addition, dCIC activity and mean mRNA levels were 2.5-i"uld higher in the metastases than in the liver samples. Since dCK is 
associated with the sensitivity to deoxy nucleoside analogues and because of the good correlation between the different dCK mea- 
surements in malignant cells and tumours, the CT-RT PCR assay will be useful in the selection of patients that can be treated with 
deoxycytidine analogues. 
(C 2003 Published by Elsevier Science Ltd. 

Keywords: Deoxycytidine kinase; Solid tumours; AML; Genicitabine; Ara-C 



1. Introduction 

Deoxycytidine kinase (dCK) (EC 2.7.1.74) is a pyr- 
imidine salvage enzyme that phosphorylates deox- 
ycytidine, deoxyadenosine and deoxyguanosine. 
Furthermore, it is responsible for the phosphorylation 
of several deoxynucleoside analogues, which arc 
widely used as anticancer and antiviral agents, such as 
1-P-p-arabinosyl cytosine (cytarabine, Ara-C), 2'2'- 
difluorodeoxycytidine (gemcitabine, dFdC) and 
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2-cklorodeoxyadenosine (cladribine, CdA). The activity 
of dCK shows a wide variation in normal and malig- 
nant cells and tissues [1.2]. 

The level of dCK activity has been reported to be 
closely related to sensitivity to deoxynucleoside analo- 
gues such as CdA [3], Ara-C and dFdC [4-9]. A defi- 
ciency of dCK leads to a resistance to deoxycytidine and 
deoxyadenosine analogues [10,11], therefore pretreat- 
ment measurements of dCK might be of predictive 
value [5]. However, relative large cell numbers or 
tumour biopsy specimens arc required for measure- 
ments of activity. The use of the polymerasfr-chain 
reaction (PCR) enables us to use small biopsy specimens 
or a small number of cells. Therefore, we applied reverse 
transcriptase (RT)-PCR techniques to develop a sensi- 
tive assay for the expression of dCK mRNA. In order to 
quantify the expression, we used competitive templates 
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(CTs) for dCK mRNA which were included in the same 
reaction mixture. 

In this study, rfCA'mRNA expression was compared 
with dCK activity and protein expression in a panel of 
solid tumour and leukaemia cell lines, as well as in 
samples from adult and paediatric leukaemia patients, 
and from liver and liver metastases from patients with 
colorectal cancer. 



2. Materials and methods 

2.1. Materials 

[8- 3 H]-2-Chloro-2'-deoxyadenosine (24.2 Ci/mmol) 
was oblained from Moravek, Rrca, CA, USA. RNAzol 
was purchased from Campro Scientific (Vccnendaal, 
The Netherlands); Moloney Murine Leukemia Virus 
Reverse Transcriptase (M-MLV-RT) and RNase inhi- 
bitor (25 lU/ul) were obtained from Promega (Madison, 
WI, USA). Deoxyribonuclcolides (dNTPs), random 
hexamers, and Taq polymerase (5 HJ/jil) were pur- 
chased from Amersham/Pharmacia Biotech (Roo- 
sendaal, The Netherlands). Primers were from Isogen 
(Maarssen* The Netherlands). RESponsc Research 
agarose was obtained from Biozym (Landgraaf, The 
Netherlands). The rabbit anti-human dCK polyclonal 
antibody was produced and tested at the Institute of 
Molecular Biology and Biotechnology at Crete by the 
group of Dr 1. Talianidis [12], 

22. Cells 

The cancer cell lines BxPC3 (pancreas), UMSCC14C 
(head and neck) A2780 (ovarian), H322 (non-small cell 
lung), CCRF CEM (T-cell acute lymphoid leukaemia), 
U937 (histiocytic lymphoma) and HL60 (acute myeloid 
leukaemia (AML)) were used to measure dCK expres- 
sion and activity. The variants AG6000 (A2780 resistant 
to gemcitabine), CCRH CEM-dCK" (CCRF CEM 
resistant to AraC) were included to set the lowest level 
of dCK activity. The solid tumour cell lines were cul- 
tured in Dulbecco's modification of Eagles medium (Bio 
Whittaker Europe, Verviers, Belgium) supplemented 
with 5% fetal calf serum (FCS) (Gibco, Paisly, UK). 
The others were cultured in Rosweii Park Memorial 
Institute (RPMI) medium (Bio Whittaker) with 10% 
FCS. Subcontinent flasks with cells were harvested and 
counted. The cell pellets were frozen at -70 °C until 
further analysis. 

2.3. Acute myeloid leukaemia .samples 

Mononuclear blast cells were derived after informed 
consent from 4 adult and 12 paediatric patients with 
acute myeloid leukaemia as previously described in Rcf. 



[13] and isolated by density gradient methods. Con- 
taminating non-leukaemic cells were removed by 
immunomagnetic beads resulting in a percentage of 
leukaemic cells above 80%, as morphologically deter- 
mined by May-Grunwald-Giemsa staining of the 
cytospins. 

2.4. Liver and liver metastases 

Biopsies' of liver metastasis and normal liver of 
patients with histologically-proven colorectal cancer 
were used to study dCK in solid tumours and normal 
tissue. Samples were immediately frozen in liquid nitro- 
gen and subsequently stored at -80 °C. Frozen tissues 
were pulverised using a microdismembrator [I], pow- 
dered tissue could be stored for the different dCK assays. 

2.5. Deoxycytidine kinase activity 

dCK activity in each of the cell lines was determined 
using 3 H-CdA as a substrate, since CdA is a better and 
more specific substrate than deoxycytidine, which can 
also be phosphorylaled by thymidine kinase 2. The 
assay was based on a method described by Arner and 
colleagues in Ref. [14]. Cell pellets (at least 1.5xl0 7 ) 
were resuspended in 50 mM Tris(hydroxymethyl)-ami- 
nomethane (TR1S)/HC1 buffer (pH 7.4), containing 4 
mM dithiothreitol (DTT), at a concentration of 30x 10 rt 
cells/ml and kept on ice. Frozen tissue was suspended in 
50 mM TR1S/HCI buffer (0.33 g/ml). Cells were dis- 
rupted by sonitication and the suspension was cen- 
trifuged for 15 min at 7000g. The supernatant was used 
for the enzyme assay and protein determination. Wc 
used 25 ul of supernatant (6-126 ug protein) and added 
10 ul 50 mM MgATP/100 mM NaF, 10 ul 250 uM 
[8- 3 H)-CdA (specific activity 0.19 uCi/nmol) and 5 ul 
TRIS/HCI buffer. This mixture was incubated at 37 "C. 
The assay was stopped by heating the samples to 90 °C. 
then 10 pi of 10 mM CdA was added to enable its loca- 
lisation by ultraviolet (UV) absorbance on the PEI thin 
layer sheets. The samples were centrifuged in order to 
spin down the protein and subsequently 5 ul was spot- 
ted onto polyethyleneimine (PET) cellulose thin-layer 
chromatography sheets. The sheets were developed with 
water to separate substrate and product. The different 
spots were cut and put into a vial with 750 ul of 0. 1 M 
HC1/0.2 M K.C1 to enable elution of radioactivity from 
the PEI cellulose, during 2 h of shaking. After addition 
of liquid scintillation fluid, radioactivity was counted. 
Enzyme activity was expressed as pmol CdA-MP 
formed per h per mg protein. 

2.6. Competitive template RT-PCR 

The isolation of RNA was performed with RNAzol 
as previously described in Ref. [13]. Cell pellets were 
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suspended in an aliquot of 200 ul of RNAzol per 10 6 
cells. Twenty micrograms of the isolated RNA was used 
for reverse transcription into cDNA. Random hexamers 
were used as primers for M-MLV-RT at a concen- 
tration of 0.045 p.g/ul. After a brief incubation at 56 ,! C 
to remove secondary structures/ samples were quickly 
cooled down on ice and annealing of the hexamers also 
took place on ice. The extension of the primers was at 
42 °C using M-MLV-RT. The reaction was terminated 
by heating at 95 C C for 5 min. cDNA samples were 
stored at -20 °C until further use. 

The design of the primers and the CT was based on the 
method described by Willey and colleagues in Refs. 
[13,15,16]. One forward (A) and two reverse primers (BC 
and C) were selected by Oligo software requiring an 
optimal annealing temperature of 58 °C, absence of hair- 
pins and no predictable stable primer-dimer formations. 

A: 5' -G A AGG G A AC ATCOCTGC AGG 

C V-TGGCCAAATTGG1TATTCATCC 

BC: S'-TGGCCAAATTGGTTATTCATCCTTGAG- 

CTTGCCATTCAGAGAGG 

The primers A and C were used to amplify a native 
template of 425 bp covering exons 2, 3 and part, of 4 of 
dCK. The primers A and BC were selected for the con- 
struction of the CT. The B sequence is upstream of C 
and amplification with A and BC resulted in a shorLer 
product of 294 bp. T his product was the CT linked by 
the A and C sequence, which was further purified and 
quantified as described in Refs. [13.15]. The P-actin gene 
was used as a reference gene and the previously pub- 
lished construction of the p-actin CT was used [16]. 

CTs of p-actin and dCK were mixed in a ratio of 10, 
100 and 1000 (I0~ 12 M/10" 13 M; 10~ 12 M/10" 14 M; 
10 12 M/IO"' 5 M) based on preliminary experiments. 
One single master mix was prepared for every cDNA 
sample containing PCR buffer (lx), MgCl 2 (1.5 mM), 
dNTPs (200 |iM), Taq polymerase (5 jil, final concen- 
tration 0.02 U/pJ), sample cDNA and CT mix in a total 
volume of 46 ul. dCK and p-actin primers (4 ul, final 
concentration 4 ng/ul of each primer) were added to 
different tubes, which already contained certain aliquots 
of the master mix. Reaction mixtures were overlayed 
with mineral oil and cycled in a MJ Research PTC-200 
(Biozym, Landgraaf, The Netherlands). A semi-hot 
start from ice to 94 °C was performed and with 1-min 
steps of denaluration at 94 °C, primer annealing at 58 °C 
and elongation at 72 3 C the PCR reaction continued for 
35 cycles. 

PCR products were separated by electrophoresis on 
2% agarose gels containing 0.1 ug/ml ethidium bro- 
mide. Quantification was by digital image analysis using 
Scion Software. The concentration of native template 
molecules in the cDNA samples was calculated by the 
ratio of the intensity of native templatc/CT and the 
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molarity of the CT mixture as previously described in 
Refs. [13,15,16]. Normalisation of the dCK expression 
to the expression of P-actin was based on data obtained 
from assays with the same master mixture. 

The dCK expression in the cell lines was determined at 
least three times on different cDNA samples of one cell 
line. The expression in the AML samples was measured 
once, because these samples were also used for several 
other assays. Levels of expression were reported as units 
dCK mRNA p-actin mRNA x 10 1 molecules. 

27. Western blotting 

Western blots were produced according to a pub- 
lished method in Ref. [12]. Protein extracts (30 ug) were 
electrophorcsed on 12.5% sodium dodecyl sulphate 
(SDS)-polyacrylamide gels and transferred by electro- 
blotting to nitrocellulose membranes (Hybond ECL 
membranes, Amersham, Buckinghamshire, UK). After 
overnight incubation in blocking buffer (5% bovine 
serum albumin in TRlS-bufTercd saline with 0.1% 
Tween 20), the blot was probed with rabbit anti-human 
dCK. polyclonal antibody (1:5000, 1 h at room tem- 
perature) and goat anti-rabbit secondary antibody con- 
jugated to horseradish peroxidase (1:40,000, 1 h at room 
temperature). Purified recombinant dCK (10 ng), 
extracts of AG6000 cells lacking dCK and parental 
A2780 ovarian cancer cells were tested on every blot to 
identify the dCK band. Levels of relative expression 
were determined by densitometry scanning (Imaging 
Densitometer, model GS-690, Biorad of the X-ray films 
(Hyperfilm ECL). dCK protein levels were expressed as 
ng dCK per mg total protein. 

2.8. Statistics 

The Spearman correlation test was used to quantify the 
relationship between dCK activity and dCK expression 
at the mRNA and protein level. Student's /-test was used 
for comparison of paired and unpaired samples. 



3. Results 

The panel of cell lines had a 10-fold variation in dCK 
activity. The lower level was set by CCRF-CEM/dCK * 
and AG6000 (1.2 and 1.4 pmol/h per mg protein, 
respectively). The mRNA expression showed a 19-fold 
variation for the cell lines (Table 1). No dCK mRNA 
could be detected in CCRF-CEM/dCK-. AG6000 
showed a normal and a truncated mRNA product as 
has been previously published [6], this complicated 
accurate quantification and this cell line was excluded 
from the mRNA expression evaluation (Fig. i). The 
reproducibility of the CT-RT-PCR assay in the cell lines 
is given in Table 1 as the relative standard deviation. 
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Tabic 1 

dCK mRNA expression in cell lines arid AM L samples" 



Sample dCK mUNA expression 

Cell line Ratio }i)- } x<1CKffi-actin 



Relative standard 

deviation 

(%) 



CCRFCEM 

HL60 

U937 

A2780 

UMSCC14C 

B.xPC3 

H322 

AML 
(UM* 
4695* 
4779 
■6250* 
7470 
7578 
7621 
7747 
7985 
8019 
8051 
8069 
8)96 
8400* 
8773 
8774 



38.0 ± J 7.3 
56.3 ±19.9 
56.9 ±6.60 
6.06±6.76 
4.73±0.38 
2.97±0.16 
5.97*1.54 

4.24 

1.87 

3.12 

10.79 

26.30 

35,23 

19.40 

I. 16 
2.77 
12.27 
9.88 
3.52 
26.17 

II. 39 
3.00 
4.40 



33 

26 

7 

15 

10 

9 

6 



AML, acute myeloid leukaemia. 

a mRNA expression is ihe mean ± standard error of Lhe mean 
(S.E.M.) of at least three measurements on different samples for the 
cell lines. The relative standard deviation (S.D.) is based on 2-3 mea- 
surements on the same sample. (*) Indicates adult AML samples, the 
others are from childhood AML; not .enough clinical material was 
available to measure enzyme activity. No expression was measured in 
the Ara-C- and gemcitabine-rrsisuim cell lines. 
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7578 
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A27S0 
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415- 
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Fig. I. Representative X-ray photos of ft-actitt and dCK gene and 
competitive template (CT) products of the cell lines and 2 AMI. sam- 
ples (7470 and 7578). The blots shows three bands for both dCK and 
p~actin: 532 and 425 arc the number of the base pairs encoded by the 
forward and reverse primers for the native cDN A of fi-actin and <iCK % 
respectively: 415 and 294 are the number of basepairs encoded by the 
competitive templates for fi-actiit and dCK. respectively. The inter- 
mediate band is a heterodimer (HD) which can be formed between the 
native cDNA and the competitive template. The contribution of the 
HO was calculated as previously described in Refs. [13 % I6]. The bands 
were scanned and the optical density (OD) was used to calculate a 
ratio between the native cDNA and CT of each target, thcraftcr these 
ratios were used to calculate the ratio for dCK and p-actin. 



The AML samples had a 30-rfold range of dCK mRNA 
expression that overlapped with thai of the cell lines, 
although the high level of expression in the cell lines 
HL60 and U937 was not seen in the patient samples. 

Liver metastases contained a mean 2. 5 -fold higher 
dCK enzyme activity than the surrounding normal liver 
samples (10.1 ±1.9 and 4.0±0.4 nmol/h/mg protein, 
respectively, 0.004. Student's Mest for paired data) 
(Fig. 2). The range of dCK activities in the liver metas- 
tases was about 17-fold (1.4-23.8 nmol/h per protein), 
while that in the liver ranged only 3-fold (2.0-6.7), The 
dCK mRNA expression showed a wide variation for 
these samples: a iSO-fold difference (0.53-80.4). The 
subgroup of liver samples had a 36-fold range (0.63 
22.7) in dCAf mRNA expression levels. 

The correlation between dCK activity and dCK mRNA 
levels in the cell lines is depicted in Fig. 3a. The corre- 
lation coefficient determined by Spearman rank corre- 
lation was r^0.75 (P^O.026). The correlation between 
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Fig. 2. Activity of dCK, amount of dCK protein and dCK mRNA 
expression in normal livers and liver metastases (mcta) from colorectal 
cancer. The dCK activity in liver metastases was significantly higheT 
than in livers (P= 0.004). | u livers, no correlation was found between 
dCK activity and dCK mRNA, while dCK activity was related to dCK 
protein (r-0.59; F~ 0.0 1 3). However, in liver metastases, dCK activ- 
ity was correlated with dCK mRNA (r- 0.497: / 1 = 0.05). but not to 
dCK protein. One liver metastasis sample is not shown having 72.9 tig 
dCK/mg protein and 80.4 as t/CK mRNA expression. One liver sample 
had a dCK mRNA of 22.7 and a dCK activity of 5 nmol/h/mg protein. 
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10 16 20 25 
nmol/h/mg protein 



(b) 60 




0.4 

ng/|ig protein 

Fig. 3. (a) Correlation between mRNA expression and dCK activity 
in a panel ol" seven cell lines of different origin. Correlation coefficient 
r 0.75 (/> = 0.026). (b) Correlation between mRNA expression and 
dCK protein expression in a panel of seven cell lines of different. origin. 
Correlation coefficient r = 0.79 (/» = 0.6l8). 



these dCK measurements in pooled liver and liver 
metastases is r = 0.416 (P-0.024). The correlation was 
also observed for liver metastases only (Fig. 2) (r = 0.497, 
P = 0.050), but not for the subgroup of livers. 

The dCK protein expression in the cell line panel 
(Figs. 3b and 4) varied about 7.2-fold. The correlation 
with dCK mRNA (Fig. 3b) was r-0.85 (/>= 0.007), as 
determined by Spearman rank correlation. In addition, 
the correlation coefficient of protein expression and 
activity was significant (r-0.79, />=0.018, Speamian). 

In the liver metastases, dCK protein levels were 
clearly higher and had a broader range compared with 
the normal liver samples (5.3-72.9 and 0.5-7.4 ng dCK/ 
mg protein, respectively) (Fig. 2). A correlation between 

1 23456789 



30 kD— *#i&3pf 



Fig. 4. Western blot of dCK protein in the different cell lines (1. 14C; 
2, A2780; 3, AG60U0; 4, BxPC3; 5, CEM; 6, H322; 7, HL60; 8 S U9.T7; 
9, dCK-HlS). 



dCK activity and protein was found in the liver samples 
(/• = 0.59, ^ = 0.013), but not in the metastases. There 
was no correlation between dCK mRNA and protein 
expression in these samples. 



4. Discussion 

The expression of dCK mRNA in a panel of solid 
tumour cell lines and leukaemia and lymphoma cell lines 
was closely correlated to the activity of this enzyme. This 
relationship had already been established for a panel of 
six leukaemia cell lines by Kawasaki and colleagues in 
Ref. [17]. For solid tumour cells from different histolo- 
gical origin such a relationship had not yet been estab- 
lished. Any such correlation is of major interest because 
of the important role of some novel deoxynucleoside 
analogues, such as gemcitabine, in t he treatment of solid 
tumours. Our panel is a mixture of both leukaemic and 
solid tumour cells, and if we considered only the four 
solid tumour cell lines, the relationship was weaker. The 
correlation between dCK activity and dCK mRNA 
expression in liver metastases was also weaker and its 
statistical significance was borderline (P - 0.05). 

The substrate CdA, which we used, is also a substrate 
for deoxyguanosine kinase (dGK), albeit with a much 
higher K m (85 uM) and lower efficiency than for dCK (5 
liM) [18,19]. The cell lines CCRF-CEM-dCK" and 
AG6000 have some CdA phosphorylating capacity, but 
they do not show a dCK protein band in western blot- 
ting. The activity measured here is most likely due to 
dGK activity, which is also present in the cell extract. 
The positive relationship between dCK enzyme activity 
and pTotein expression has also been observed in studies 
of Kawasaki [20] and Spasokoukotskaja [21]. The var- 
iation in the dCK mRNA expression in the AML sam- 
ples (30-fold) is somewhat lower than that found in a 
study with 13 childhood AML samples (50-fold) [22] 
and 35 childhood ALL and AML samples [23] in which 
samples from relapsed leukaemic cells had a lower dCK 
expression. In addition, for the enzyme activities, a 
comparable range in dCK levels was found [24]. 

The relationship between dCK activity and response 
to treatment with deoxynucleoside analogues has been 
investigated in several malignancies. Hairy ceil leukae- 
mia, which is very sensitive to CdA showed both high 
dCK and dGK activity. The degree of total CdA phos- 
phorylation correlated with the response to CdA treat- 
ment of both hairy cell leukaemias and chronic 
lymphocytic B-cell leukaemias, although other factors 
also seemed to be involved [3,25]. In AML, dCK activ- 
ity correlated with the response to Ara-C treatment in 
21 patients [26). ALL patients more often relapsed when 
dCK expression was low or absent [27]. Transfection 
experiments with dCK and other deoxynucleoside 
kinase [28] genes has underlined the important role of 
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dCK in the antitumour activity of deoxyuucleoside 
analogues. Increased expression of dCK gene by trans- 
fection restored and enhanced the sensitivity to deoxy- 
nucleoside analogues that are activated by dCK [8,9,29], 
both in vitro and in vivo. 

Most of the AML samples were derived from child- 
hood AMLs [13]. which are always treated with an Ara- 
C-containing protocol. The large range of dCK mRNA 
expression, which overlapped with the expression of the 
cell lines, indicates that this disease in general has an 
intrinsic sensitivity to Ara-C, hut thai individual differ- 
ences in sensitivity to Ara-C exist. Moreover, this has 
been reported and has prognostic significance [30,31]. 
Recently, a clinical protocol including Ara-C was started 
in which dCK expression will be evaluated prospectively. 

Gemcitabine is currently being used for the treatment 
of ninny solid malignancies. Single agent gemcitabine 
shows little activity in colorectal cancer, but it is not 
clear whether this is due to a low dCK expression. 
Therefore, we designed this study to determine dCK. 
expression in liver metastases. No information is cur- 
rently available on the relationship between response 
and dCK activity in patients. In cell lines, we previously 
reported a relationship between gemcitabine sensitivity 
and dCK activity [6,7], while in xenografts [5], we also 
observed a relationship between gemcitabine sensitivity 
and dCK activity. dCK gene transfer enhanced the 
antitumour effect of gemcitabine on tumour xenografts 
[9], confirming the relationship between dCK and gem- 
citabine activity. It is remarkable that the dCK activity 
in the liver samples in our sttidy showed relatively small 
variations and did not exceed a level of 5 nmol/h/mg 
protein. In contrast, protein and mRNA expression 
levels showed more variation. In liver metastases, both 
activity and mRNA varied considerably and correlated 
with one another. This would give an advantage for 
liver metastases compared with liver in selectively acti- 
vating deoxynucleoside analogues in liver metastases. A 
high dose of gemcitabine is expected to lead to selective 
activation in metastases, and when given via a hepatic 
artery infusion would not leak to the systemic circula- 
tion as livers have a high deaminase activity. The much 
smaller range of dCK activity in the liver samples com- 
pared with the liver metastases also indicated that in the 
liver a posttranslational regulation mechanism exists, 
which seems to be deregulated in the liver metastases. U 
has been reported thai dCK activity can be regulated by 
its phosphorylation [32.33]. dCK purified from leu- 
kuemie blasts can be phosphorylated by Protein Kinase 
C (PKC), resulting in a 100% increase in enzyme activ- 
ity [32]. However, recombinant dCK is a poor substrate 
for PKC, but is effectively phosphorylated by protein 
kinase A, and this is not accompanied by an increase in 
enzyme activity [33]. An enhanced dCK activity could 
be eliminated by protein phosphatase treatment of lym- 
phoid cells, supporting a secondary modification of the 



dCK protein [34]. In addition, the promoter for dCK 
contains several potential regulatory transcription fac- 
tor sites [35,36], although it is not yet clear whether this 
regulation is tissue-specific or deregulated in tumours. 

For the enzyme activity assays and western blotting;, 
relatively large pieces of tissue or many cells are 
required, and these are not always available. The cur- 
rent PCR assay detects a wide range of dCK mRNA 
levels. CT-RT-PCR can be performed with any PCR 
equipment, and does not need the requirement of an 
expensive Taq man or light cycler apparatus. In addi- 
tion, results from this type of assay shows a high inter- 
and intralaboratory agreement [371- Measurement of 
e/CA'mRNA expression offers the possibility for larger- 
scale studies, in patients with solid tumours and leu- 
kaemia, to determine a relationship between dCK 
expression and response to either gemcitabine or Ara-C. 
This data should be followed by a prospective study to 
select patients for deoxynucleoside analogue treatment 
based on their dCK activity. 
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Expression of deoxycytidine kinase in leukaemic cells compared with, solid 
tumour cell lines, liver metastases and normal liver. 

van der Wilt CL , Kroep JR. Loves WJ , Rots MG. Van Groeningen CJ . Kaspers GJ, 
Peters GJ . 

Department of Medical Oncology, VU University Medical Center, Amsterdam, The 
Netherlands. 

Deoxycytidine kinase (dCK).is required for the phosphorylation of several 
deoxyribonucleoside analogues that are widely employed as chemotherapeutic agents. 
Examples include cytosine arabinoside (Ara-C) and 2-chIorodeoxyadenosine (CdA) in 
the treatment of acute myeloid leukaemia (AML) and gemcitabine to treat solid tumours. 
In tins study, expression of dCFC mRNA was measured by a competitive template reverse 
transcriptase polymerase chain reaction (CT RT-PCR) in seven cell lines of different 
histological origin, 16 childhood and adult AML samples, 10 human liver samples and 1 1 
human liver metastases of colorectal cancer origin. The enzyme activity and protein 
expression levels of dCK inthe cell lines were closely related to the mRNA expression 
. levels (r=0.75, P=0.026 and r=0.86, P=0.007). In AML samples, dCK mRNA expression 
ranged from 1.16 to 35.25 (xl0(~3)xdCK/beta-actin). In the cell line panel, the range was 
2.97-56.9 (xl0(-3)xdCK/beta-actin) of dCK mRNA expression. The enzyme activity in 
liver metastases was correlated to dCK mRNA expression (r=0.497, P=0.05). In the liver 
samples, these were not correlated. dCK mRNA expression showed only a 36-fold range 
in liver while a 150- fold range was observed in the liver metastases. In addition, dCK 
activity and mean mRNA levels were 2.5-fold higher in the metastases than in the liver 
samples. Since dCK is associated with the sensitivity to deoxynucleoside analogues and 
because of the good correlation between the different dCK measurements in malignant 
cells and tumours, the CT-RT PCR assay will be useful in the selection. of patients that 
can be treated with deoxycytidine analogues. 
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Abstract 

To further explore the pathophysiological significance of arachidonic acid-sensitive potassium channels, RT-PCR and Western 
blot analysis were used to investigate the expression changes of TREK, channels in cortex and hippocampus in rat experimental 
acute cerebral ischemia in this study. Results showed that TREK.- 1 and TRAAK mRNA in cortex, TREK-l and TREK -2 mRNA 
in hippocampus showed significant increases 2 h after middle cerebral artery occlusion (MCAO). While the mRNA expression levels 
of the all three channel subtypes increased significantly 24 h after MCAO in cortex and hippocampus. At the same time, ihc protein 
expressions of all the three channel proteins showed significant increase 24 h after MCAO in cortex and hippocampus, but only 
TREK-1 showed increased expression 2 h after MCAO in cortex and hippocampus. Immunohistochemical experiments verified that 
all the three channel proteins had higher expression levels in cortical and hippocampal neurons 24 h after MCAO. These results 
suggested a strong correlation between TREK channels and acute cerebral ischemia. TREK channels might provide a neuroprotec- 
tive mechanism in the pathological process. 
© 2004 Elsevier Inc. All rights reserved. 

Keywords: TREK: Potassium channels; Neuroprotection; Cerebral ischemia 



The tandem-pore domain potassium channels (K 2 p) 
constitute a targe subfamily of potassium channel fam- 
ily. Since the identification of its first member in 1996 
[1], the channel family has been elucidated further. Alto- 
gether 1 8 K 2 p genes from the human genome have been 
cloned, 17 of which have been proved to be functional. 
According to their amino acid sequence identity and reg- 
ulation mechanisms, K 2 p can be divided into six main 
classes [2]. TREK (the f WIK-related arachidonic acid- 
stimulated and meehano-gated potassium channels), 
including TREK-i, TREK-2, and TRAAK, is one class. 
They have a distinctive structural motif of four trans- 
membrane segments and two pore domains (4TMS/ 
2P) which markedly differs from the structures of classic 
potassium channel: the 6TMS/1P of voltage-dependent 

* Corresponding author. Fax: +86 10 6301 7757. 
E-mail address: Wungxl@imm.ui' .en (X.-L. Wang). 

00O6-29IX/$ - see front matter © 2004 PJsevicr Inc. All rights reserved. 
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potassium channels (Kv) and 2TMS/1P of inward-recti- 
fying potassium channels (Kir). In natural and recombi- 
nant ceils, their currents are simultaneous, voltage- and 
time-independent, and non-inactivating. Together with 
other K 2 p, they produce the baseline or leak current, 
forming the cellular membrane resting potential [3]. 

Studies have shown that TREK channels have spe- 
cific expressions in central nervous system [4]. Electro- 
physiological experiments showed that they can also 
be strongly activated by multiple microenviroiiment 
parameters in vivo including polyunsaturated fatty acids 
(PUFAs) such as arachidonic acid (AA), while insensi- 
tive to the classic potassium channel antagonists like 
TEA, 4-AP, and Cs + . They are also sensitive to intracel- 
lular pH changes and body temperature changes [5,6]. 
TREK-1 and TREK-2 can be strongly activated by 
intracellular acidosis, while TRAAK can be activated 
by intracellular alkalosis [7], Cell volume, in vitro, can 
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also influence their channel activity. The negative pres- 
sure applied to the neurons in vitro that mimics the cell 
volume increase in edema may lead to the channel acti- 
vation [8]. All the evidence suggests that TREK might 
play important roles in conditions such as cerebral ische- 
mia and inflammation. 

However, the exact functions of TREK channels in 
such pathological conditions are still not clear until 
now. In our previous study, it was found that 3 and 30 
days after permanent bilateral carotid artery ligation 
(BCAL), TRAAK mRNA expression increased signifi- 
cantly in rat cortex, while TREK-2 and TRAAK mRNA 
increased significantly in hippocampus [9], This indicated 
that they were involved in chronic cerebral ischemia. 

To further elucidate their pathological roles in acute 
cerebral ischemia, the present study examined the 
mRNA and protein expressions, by RT-PCR and Wes- 
tern blot analysis, in experimental acute cerebral ische- 
mia in rat cortex and hippocampus. Their histological 
expression in vivo has also been investigated by immu- 
nohistochemistry. By studying their involvement in 
acute cerebral ischemia in vivo for the first time, this 
study provided direct evidence for their possible neuro- 
protective effect in acute cerebral ischemia. 



Materials and methods 

Experimental acute cerebral ischemia. Adult mule Wjjitar rats 
weighing 260-300 g were used and were provided by the Animal Center 
of Chinese Academy of Medical Sciences* Beijing, China, MCAO model 
was used to produce rat acule cerebral ischemia [10]. Rats were divided 
into sham, ischemia for 2 h and ischemia for 24 h groups (w — 5). Briefly, 
rats were anesthetized and placed in a supine position. The common 
carotid (CCA), external carotid (ECA), internal carotid (ICA), and 
pterygopalatine (PPA) arteries were exposed and ligated except ICA. 
The cerebral ischemia was produced by inserting a 0.28 mm monofila- 
ment suture with a flamed tip about 18-20 mm into ICA from the 
bifurcation of the ICA and ECA, thus occluding the ostium of the 
middle cerebral artery (MCA). Sham group rats received the same 
operation without inserting filament in the MCA. All animals were 
treated in accordance with the guidelines established by the Experi- 
mental Animal Committee of Chinese Academy of Medical Sciences. 

Reverse transcription polymerase chain reaction. Analyses of 
mRNA expressions of TREK- 1, TREK-2, and TRAAK in whole areas 
of rat cortex and hippocampus were performed by everse transcription 
polymerase chain reaction (RT-PCR) method according to procedures 
described previously with modifications [9]. Total cortex and hippo- 
campus RNA was isolated as instructed by the TRIzol Reagent 
guidelines (Invitrogcn, Carlsbad, CA). For cDNA synthesis, \5pL 
reaction mixture containing total RNA 2 u.g and random primers 1 ug 
were heated to 70 °C for 5 min. Then dNTP 12.5 mmol, RNasin 25 IU, 
and M-MLV reverse transcriptase 200 IU (Promega, Madison, UI) 
were added to a 25 uL reverse transcription mixture and incubated at 
37 °C for J h after homogenization. PCR amplification was carried out 
with 5 uL cDNA product in a 50 uL reaction volume containing dNTP 
10 mmol, Taq DNA polymerase 2.5 IU (TAKARA Biotechnology, 
Dalian), and specific oligonucleotide primers 20pmol. The primer 
pairs for TREK-1 were 5'-ACAGAACTTCATAGCCCAGCAT-3' 
and S'-TCCCACCTCTTCCTTCGTCT-r ; those for TREK-2 were 5'- 
CGTCCATCGTCCCAAATT-3' and 5'-TTGGAGGAGTTTCCTA 



CCG-3'; those for TRAAK were 5'-AACTGG1TGCGAGCGGT 
GTC-3' and 5' GGGCTTCTTCGTTGGGTTGG; and those for 
P-actin were 5'-CATCTCTTGCTCGAAGTCCA-3' and 5'-ATCATG 
TTTGAGACCTTCAACA-3' (TAKARA Biotechnology). PCR was 
performed for 28 or 30 cycles of 94 °C for 1 rain, 57 °C for 45 s, and 
72 °C.for I min for [i-actin and TREK-1, respectively, and 30 cycles of 
94 °C for 1 min, 60 °C for 45 s, and 72 °C for 1 min for TREK-2 and 
TRAAK. For all the reactions, preliminary experiments were per- 
formed to determine the appropriate PCR cycles before they reached 
the balance stage. Five microliter PCR products were separated by 
electrophoresis and visualized after ethidium bromide staining under 
UV light, photographed, and analyzed with Kodak ID 3.5 system. 

Western blot analysis. Levels of protein expressions of the three 
members of TREK were determined by DAB method as described 
earlier [II]. Rat cerebral cortex and hippocampus membrane proteins 
were prepared by being homogenized in lysis buffer (Truv-HCl 5 raM, 
sucrose 0.32 M, pepstatin A I u$/mL, PMSF 0.2 mM, aprotinin 
1 ug/mL, and leupeptin I \ig /mL, pH 7.4) ( Sigma- Aldrich, St. Louis, 
MO); then the lysates were centrifuged at 1000; for 10 min at 4 °C. The 
supcrnatants were further subjected to centrifugation at 100,000; for 
60 min at 4 °C. The subsequent crude pellet was resuspended in Lhc 
same solution and protein concentration was detected by the Bradford 
method. AH procedures were carried out on ice to prevent proteolysis. 

For electrophoresis, the equivalents of 100 ^g protein were applied 
to each lane of a 10% SDS-PAGE gel and run at standard conditions. 
After that, the proteins were electroblotted onto nitrocellulose mem- 
brane (0.45 um, Amersbam* Buckinghamshire) at J inA/cm 2 for 2.5 h 
by semi-dry apparatus (Liuyi, Beijing). Membranes were blocked in 5% 
non-fat dried milk in TBST (100 mM Tris-HCL 154 mM NaCl, and 
0.05% Tween 20 (v/v), pH 7.5) for 2 h at room temperature with agi- 
tation, followed by being incubated with polyclonal primary antibodies 
of TREK- 1, TREK-2, and TRAAK, and (*4ciin (1:1000 dilution with 
TBST, Santa Cruz Biotechnology, CA) at 4°C overnight. After thai, 
the membranes were incubated with secondary antihodics (rabbit anti- 
goat IgG conjugated to horseradish peroxidase, HRP, 1 :5000 dilution, 
Santa Cruz) at room temperature for 2h. The antigen-antibody 
complexes were visualized with 3,3'^iaminobenzidinc-tctrahydro- 
chloride (DAB, Research Genetics, Huntsville, AL), and photo- 
graphed and analyzed with Kodak ID 3.5 system. All data were 
expressed as ratios to fi-actin protein. Hie specificity of primary anti- 
bodies was tested by prc-absorption with the antigenic peptide. 

Immunohistochemistry. I mmun ©histochemistry was performed on 
cryosections as described previously [12]. Rats of sham group and 24 h 
after MCAO group were transcardially perfused with 4% parafor- 
maldehyde after being thoroughly anesthetized. Next, the brains were 
removed, post-fixed in 4% paraformaldehyde at 4 °C for 4 h, and 
transferred to 30% sucrose and 4% paraformaldehyde solution at 4 n C 
for 48 h until the brain sank to the bottom. After being frozen in OCT 
compound (Jung, Leica, Germany) for 10 s in liquid nitrogen, the 
brains were cut into 40 jim coronal sections in a cryostat (CM 1900, 
Leica). Sections were consecutively kept in 3% hydrogen peroxide for 
15 min to quench the endogenous peroxidase activity and 5% rabbit 
scrum for 30 min to block. They were then incubated with primary 
antibodies (1:50 dilution, Santa Cruz Biotechnology) for 2 h at 37 °C 
and washed in PBST three times for 10 min. Following this, sections 
were incubated with HRP-conjugated secondary antibodies (J:100 
dilution, rabbit anti-goat IgG, Santa Cruz Biotechnology) at room 
temperature for 2h with gentle agitation. Then sections were visual- 
ized with DAB and slide-mounted, dried, defatted, and covcrslipped. 
The specificity of the antibodies was assessed by pre-ahsorptton with 
the antigenic peptide. For quantification, five positively stained neu- 
rons were randomly selected and analyzed with Image Pro Plus 
software. 

Statistical analysis. All the data of RT-PCR, Western blot analysis, 
and immunohistochemical results were analyzed with one-way 
ANOVA tests. Results were expressed as means rf: SD. And signifi- 
cance was considered at p < 0.05. 
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Results 



mRNA expression changes of TREK-1, TREK-2, and 
TRAAK in cortex and hippocampus after MCAO 

mRNA expression changes of TREK-1, TREK-2, 
and TRAAK were detected in both cortex and hippo- 
campus 2 and 24 h after MCAO. The specificity of 
RT-PCR products was confirmed by the results of aga- 
rose gel electrophoresis that yielded single product 
bands of the expected sizes in all cases. Fig. 1 shows that 
in cortex, 2h after MCAO, TREK-1 and TRAAK 
mRNA expressions increased significantly, while all 
the three channel mRNA showed significantly enhanced 
expressions 24 h after MCAO compared with the sham 
group. In hippocampus, TREK-1 and TREK-2 mRNA 
expressions increased significantly both 2 and 24 h after 
MCAO. But TRAAK mRNA showed enhanced expres- 
sion only 24 h after MCAO. Results also showed that 
the mRNA expression levels of the three channels in hip- 
pocampus are lower than those in cortex. 

Protein expression changes of TREK-1, TREK-2, and 
TRAAK in cortex and hippocampus after MCAO 

To detect the expression changes of TREK-1, TREK- 
2, and TRAAK, whole cortex and hippocampus were 
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subjected to the isolation of the membrane proteins. 
So the expression changes of related channel proteins 
represented their expressions of the whole areas. Expres- 
sions of the three channel proteins were measured by 
Western blot analysis 2 and 24 h after MCAO, respec- 
tively. Fig. 2 showed that 24 h after MCAO, all the three 
channel proteins showed significantly increased expres- 
sions in cortex, indicating that more functional channel 
proteins have been synthesized. But only TREK-1 
showed higher expression 2h after MCAO, while 
TREK-2 and TRAAK did not show statistically signif- 
icant changes due to the supposed reason of time delay 
between the gene activation and protein expression. At 
the same time, all the three proteins also showed signif- 
icantly increased expressions in hippocampus 24 h after 
MCAO, while 2 h after MCAO, only TREK-1 channel 
protein showed significant increases in hippocampus. 

Histological expressions of TREK-1, TREK-2 f and 
TRAAK channel proteins in cortical and hippocampal 
neurons 24 h after MCAO 

In the study, the expression changes of TREK-1, 
TREK-2, and TRAAK in PCg region of cortex and 
CA1 region of hippocampus were analyzed because 
these two regions are most vulnerable to ischemic insults 
in CNS. Immunohistochemical experiments showed that 
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Fig. 1. Expression changes of TREK-1, TREK-2, and TRAAK mRNA in cortex and hippocampus 2 and 24 h after MCAO. Sham, 2. and 24 h 
represent sham group. 2, and 24 h after MCAO, respectively. (A) Representative cthidium bromide-stained ugarose gel of RT-PCR products of" 
TREK-1, TREK-2, TRAAK, and p-actin of cortex after electrophoresis. Five microliter RT-PCR products were loaded to each lane, respectively. 
(B) Quantification of the relative expression levels of TREK-1, TREK-2, and TRAAK in cortex after acute MCAO. (C) Representative elhidium 
bromide-stained agarose gel of RT-PCR products of TREK-1, TREK-2, TRAAK, and (5-actin of hippocampus after electrophoresis. (D) 
Quantification of The relative expression levels of TREK- 1, TREK-2, and TRAAK in hippocampus after MCAO. Data were acquired by image 
analysis of the RT-PCR bands and were ratios compared with }i-actin. Each value represents the mean densitometry value of three repeated RT- 
PCRs. n — 5 for each group. *p < 0.0 1 compared with the sham group. 
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Fig. 2. Expression changes of TREK-1, TREK-2. and TRAAK channel proteins in cortex and hippocampus 2 and 24 h after MCAO. Sham, 2, and 
24 h represent sham group, 2, and 24 h after MCAO, respectively. (A) Representative Western blots of TREK-1, TREK-2, TRAAK. and P-actin 
proteins detected by DAB staining in cortex. (B) Quantification of Western blot signals of TREK-1, TREK-2, and TRAAK in cortex after MCAO. 
(C) Representative Western blots of TREK-1. TREK-2, TRAAK, arid P-aclin proteins detected by DAB staining in hippocampus. (D) 
Quantification of Western blot signals of TREK-1, TREK-2, and TRAAK in hippocampus. AU data were ratios compared with p-actin protein. 
Each value represents the mean of three membrane preparations normalized to the densitomctric value. Values are means ± SD. n — 5 for each 
group. > < 0.01 compared with the control group. 



in both these two regions, less positively stained neurons 
were detected 24 h after MCAO compared to the sham 
group, because many have died following MCAO sur- 
gery (Fig. 3), But for the remaining neurons in both 
PCg and CA1 regions, significantly increased signals of 
TRFK-1 , TREK-2, and TRAAK proteins were detected 
24 h after MCAO as shown in Table I. 



Discussion 

Stroke, including occlusive stroke, is one of the lead- 
ing causes of death and dependence in activities of daily 
living in the world. It can result, in severe neuronal 
degeneration and consequent loss of brain functions. 
Recent developments have further advanced our knowl- 
edge of the cellular and molecular mechanisms, and 
have confirmed that cerebral ischemia gives rise to wide- 
spread alteration of gene expressions and regulations. 
However, the pathophysiological process and the pro- 
tective mechanisms following brain injury induced by 
ischemia are extremely complex and have not been fully 
elucidated. 

TREK-1, TREK-2, and TRAAK belong to TREK 
class of the K 2 p family. Previous studies have proved 



that they are highly expressed in regions prone to Ische- 
mia-induced cell death such as cortex, hippocampus, 
and amygdala, but expressed in much lower level in re- 
gions like spinal cord, brainstem, and substantia nigra 
that are evolutionary more ancient [4]. While animal 
models for transient forebrain ischemia identified that 
cortex and hippocampus are very vulnerable to ischemic 
stroke, especially hippocampus. And within the hippo- 
campus, pyramidal cells in the CA1 region are most sus- 
ceptible to ischemic insults. TREK-1, TREK-2, and 
TRAAK can be strongly activated by pathological 
microcnvironmental changes such as membrane distor- 
tion, intracellular pH increase/decrease [13]. Since these 
pathological changes usually occur in cerebral ischemia, 
the above-mentioned channel subtypes are suggested to 
be involved in the pathological process. 

When cerebral ischemia occurs, the activation of 
phospholipase A 2 , induced by intracellular Ca 2 "*" release, 
resulted in degradation of membrane phospholipids and 
the accumulation of unsaturated fatty acids like AA, as 
well as lysophosphatidylcholine (LPC) and lysophos- 
phatidic acid (LPA) [14]. At the same time, the non-ox- 
ygen and low-glucose condition in brain can lead to the 
increase of glycolysis and consequent lactate concentra- 
tion increase, resulting in intracellular acidosis. Also, the 
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Fig. 3. Immunohistochemiod analysis of TREK- 1, TREK-2, and TRAAK channel protein expressions in cortical and hippocampal neurons 24h 
after MCAO. Sham and 24 h represent sham group and 24 h after MCAO, respectively. (A) Positive neurons detected by OA ft sunning in conical 
cryoscctions of PCg area. (B) Positive neurons detected by DAB staining in hippocampal cryoscctions of CA1 area. Scale bar = 0.01 mm. 



Table I 

Enhanced expressions of TREK, channel proteins in cortical and 
hippocampal neurons 24 h after MCAO 



Channel protein . Sham 24 h 



(A) 






TREK-l 


83.99 + 7.33 


63.95 ±1.83* 


TRbK-2 


75.J2drS.28 


55.59 ± 2.92* 


TRAAK 


92.32 ±5.60 


70.07* 1. US* 


(B) 






TRbK-1 


123,67 ±5.30 


89.46 ± J. 56" 


TREK-2 


116.210:4.37 


79.26 ±3.43* 


TRAAK 


102.63 ±4.18 


81.67 ±2.02* 



Sham and 24 h represent sham group and 24 h after MCAO, respec- 
tively. Five positively stained neurons were randomly selected and 
analyzed in each cortical or hippocampal section. Each value repre- 
sents the mean of live positively .stained neurons normalized io the 
densilometric value. (A) Analysis results of cortical sections. (B) 
Analysis results of hippocampal sections, Values are means ± SD. 
// ~ 3 for each group. 

p < 0.0 1 compared with the control group. 

edema caused in ischemia mimics the function of nega- 
tive mechanical stretch. All these factors are potent acti- 
vators of TREK-l, TREK-2, and TRAAK in the case of 
acute cerebral ischemia (intracellular acidosis can acti- 
vate TREK-l and TREK-2, but not TRAAK). In our 



experiments, TREK-l, TREK-2, and TRAAK showed 
enhanced expressions both in cortex and hippocampus 
at mRN A and protein levels in experimental acute cere- 
bral ischemia. Although many cells died, the remaining 
neurons had markedly higher expression levels of TREK 
channels 24 h after MCAO surgery, indicating that more 
functional proteins have been synthesized in the living 
neurons. This might be the reason why they survive. 
So, it seems that in the short period after cerebral ische- 
mia, the TREK channels were activated by the patho- 
logical changes in the micr ©environment, but in a 
relatively long period, their genes were activated and 
more functional proteins were expressed. 

In physiological conditions, TREK- 1, TREK-2, and 
TRAAK can produce leak current that determines the 
resting potential, and thus influences the cell excitability. 
Usually, when potassium channels are activated, the ef- 
flux of K + from cells can lead to membrane hyperpolar- 
ization, which can decrease the cell excitability. In the 
ease of ischemia, the decrease of cell excitability has 
protective e fleet by reducing Ca"" 1 * influx and release of 
stimulative neurotransmitters. This is the same protec- 
tive mechanism as ATP-scnsitive potassium channels 
(Katp) which can be activated in the same condition 
and acts as a protective agent by counterbalancing the 
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depolarization induced by glutarnine, 5-hydroxytrypta- 
mine (5-HT), etc, [15]. So, the activation of TREK genes 
and enhanced expression of functional channel proteins 
following MCAO can counteract the excitotoxicity 
induced by stimulative neurotransmitters such as 
glutamate and 5-HT. In this way, it seems that the acti- 
vation of TREK- 1, TREK-2, and TRAAK in cerebral 
ischemia provides a neuroprotective effect. This was 
supported by other studies. These studies showed that 
neuroprotective agents such as riluzole and some vola- 
tile anesthetics can also activate TREK-1, TREK-2, 
and TRAAK [16,17], 

It was also reported that TREK-2 can also be regu- 
lated by Gs and Gi that were linked to 5-HT and gluta- 
mate receptors [18]. This indicates that its activity is 
closely related to these excitatory neurotransmitters 
and may be in charge of tuning neuronal excitability 
in response of a variety of neurotransmitters and 
hormones. 

But some scholars hold different opinions. In one 
study, sipatrigine as a neuroprotective agent showed 
the function of potent inhibition of TREK-1, TREK- 
2, and TRAAK activity. But actually, no evidence 
showed that sipatrigine provides protective effect by 
inhibiting TREK-1, TREK-2, and TRAAK, and this 
might only be a side-effect [19]. And recently, Pq 2 or 
O2 concentration has been supposed to produce inhibi- 
tory effect to TREK-1 activity, thus casting a doubt 
on the role TREK-1 may play in neuroprotection in 
cerebral ischemia [20,21]. But because of the lack of spe- 
cific antagonists, there is still much work to do to fully 
elucidate the question. 

Cerebral ischemia is a complex pathological process. 
There are still uncertainties as to the protective mecha- 
nisms and it may involve a cascade of pathological 
molecular events: Our study once again provided direct 
evidence for the correlation between TREK channels 
and cerebral ischemia. TREK-1, TREK-2, and TRAAK 
might provide an important neuroprotective mechanism 
in the pathological process following acute cerebral 
ischemia. 
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Gene expression signatures and biomarkers of noninvasive and invasive 
breast cancer cells: comprehensive profiles by representational difference 
analysis, microarrays and proteomics 
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We have characterized comprehensive transcript and 
proteomic profiles of cell lines corresponding to normal 
breast (MCF10A), noninvasive breast cancer (MCF7) and 
invasive breast cancer (MDA-MB-231). The transcript 
profiles were first analysed by a modified protocol for 
representational difference analysis (RDA) of cDNAs 
between MCF7 and MDA-MB-231 cells. The majority of 
genes identified by RDA showed nearly complete con- 
cordance with microarray results, and also led to the 
identification of some differentially expressed genes such 
as lysyl oxidase, copper transporter ATP7A, EphB6, 
RUNX2 and a variant of RUNX2. The altered transcripts 
identified by microarray analysis were involved in cell-cell 
or cell-matrix interaction, Rho signaling, calcium home- 
ostasis and copper-binding/sensitive activities. A set of 
nine genes that included GPCR11, cadherin 11, annexin 
Al, vimentin, lactate dehydrogenase B (upregulated in 
MDA-MB-231) and GREB1, S100A8, amyloid fi pre- 
cursor protein, claudin 3 and cadherin 1 (downregulated in 
MDA-MB-231) were sufficient to distinguish MDA-MB- 
231 from MCF7 cells. The downregulation of a set of 
transcripts for proteins involved in cell-cell interaction 
indicated these transcripts as potential markers for 
invasiveness that can be detected by methylation-specific 
PCR. The proteomic profiles indicated altered abundance 
of fewer proteins as compared to transcript profiles. 
Antisense knockdown of selected transcripts led to 
inhibition of cell proliferation that was accompanied by 
altered proteomic profiles. The proteomic profiles of 
antisense transfectants suggest the involvement of pepti- 
dyl-prolyl isomerase, Raf kinase inhibitor and 80kDa 
protein kinase C substrate in mediating the inhibition of 
cell proliferation. 
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Introduction 

The transformation of a normal cell into a cancer cell 
has been correlated to altered expression of a variety of 
genes (Perou et aL, 2000; Becker et aL, 2005). The 
expression of some of these genes is a direct result of 
sequence mutation, whereas other changes occur due to 
alterations in gene products that participate in specific 
pathways. The changes in gene expression have been 
routinely characterized by classical subtraction hybridi- 
zation and differential display approaches (Cerosaletti 
et aL, 1995; Alpan et al, 1996). With the availability 
of the human genome sequence and sequences for a 
number of other model organisms, traditional methods 
have largely been replaced by gene microarrays 
(Khan et aL, 2001). These analyses have been used to 
characterize the molecular basis of a variety of diseases 
including cancer. A comprehensive analysis of a large 
number of cancer cell lines allowed clustering of genes 
into groups based on their expression patterns in 
phenotypically related cell lines (Khan et aL, 2001; . 
Dan et aL, 2002; Rosenwald et aL, 2002; van't Veer 
et aL, 2002). The results of profiling experiments 
indicated expression of specific gene clusters in cell lines 
that have the same origin or have arisen from the same 
organ (Ross et aL, 2000). A complementary approach 
that has been used in limited ways is proteomics. 
Proteomics scores for changes in ainerent proteins 
and peptides in cells with characteristic pheno- 
typic differences. However, a comparative analysis of 
transcripts and proteins to establish a relationship 
between transcript changes and protein levels has not 
yet become routine. 

Although expression profiling of tumor tissue and its 
comparison with normal tissue, in principle, is most 
appropriate to obtain the genetic signatures of a tumor 
type, such comparisons have not been free of attendant 
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complications. These complications arise due to hetero- 
geneity of tumor specimens wherein any cell type- 
specific changes are likely to be masked by other cell 
types that constitute the tumor specimen. For this 
reason, well-characterized cell lines established 
from tumor tissue may prove more informative and 
have been considered useful by cancer researchers. 
Comparing gene profiles between cell lines has the 
potential to reveal genes that could be causative for 
the phenotype and other genes that can serve as tumor 
biomarkers. 

Our investigations are aimed at designating a subset 
of transcripts that could distinguish a normal breast cell 
from a breast cancer cell and help to predict tumorigenic 
or metastatic potential of a transformed cell. We 
describe here transcript and proteomic profiles of a 
normal breast cell line, a tumorigenic but noninvasive 
breast carcinoma cell line and an invasive breast 
carcinoma cell line, and summarize them as a set of 
candidate biomarkers or targets for therapeutic inter- 
vention. The comparison of transcript profiles with 
proteomic profiles demonstrated that altered proteins 
were not always represented in the microarray desig- 
nated profiles and vice versa. Furthermore, we have 
targeted five transcripts that were upregulated in MCF7 
cells for investigating their role in cell proliferation 
pathways. The proteomic profiles have revealed that 
inhibition of cell proliferation by antisense knockdown 
was mediated by a specific set of proteins. 



Results 

Representational difference analysis 
As described in the Materials and methods section, 
RDA was performed by using cDNAs from MCF7 and 
MDA-MB-231 as tester/driver or driver/tester combina- 
tion. The difference product in the first case represents 
the genes that are either upregulated in or specific to 
MCF7. On the other hand, the difference product of 
MDA-MB-231 (tester) and MCF7 (driver) hybridiza- 
tion resulted in the isolation of cDNAs that are either 
upregulated in or specific to MDA-MB-231. The initial 
linkers used in this protocol had internal BgUl sites. 
One strand of the linker was used to amplify both the 
tester and driver cDNAs after linkers had been ligated 
to cDNAs. After removal of linkers from amplified 
cDNAs by digestion with BgUl, a dephosphorylated 
BgUl adaptor was ligated to tester DNA. The BgRl 
adaptor had an internal EcoRl site. The difference 
product was digested with EcoRl and cloned in pBlue- 
S oript vector. The cloning efficiency of the diff e r e nce 



involvement of the majority of these cDNAs is well 
characterized either in tumorigenesis or in metastasis. 
The phenotypic characteristics of MCF-7 and MDA- 
MB-231 ideally match with the biological significance of 
these genes. The alterations in transcripts for Rho 
signaling proteins, Ca 2+ binding/requiring proteins, 
tight junctions/anchoring junctions/gap junctions, cop- 
per binding or sensitive proteins, and RUNX2 are 
particularly noteworthy. 

The differential expression of a representative number 
of RDA clones was validated by semiquantitative PCR. 
As shown in Figure 1, these transcripts were either 
specific to or upregulated in the cell line that was used as 
a tester. Such analyses demonstrated that more than 
90% of the clones were differentially expressed. The 
abundance of transcripts and the results of RT-PCR 
were also confirmed by Northern blotting (Figure 2). 
The pattern of hybridization clearly indicates that all 
these transcripts showed differential expression in - 
MCF7 and MDA-MB-231 cells that were used as 
driver/tester combinations for the RDA. 



Gene microarrays 

After obtaining preliminary molecular signatures of 
MCF7 and MDA-MB-231 cells by RDA, we used 
Affymetrix gene micoarrays to expand the above 
analysis to identify a comprehensive set of transcripts 
that is deregulated in invasive breast carcinoma cells. 
The comparisons of cell lines on the basis of transcripts 
that are either present or absent as shown in Figure 3 
revealed that a set of 123 genes distinguishes MDA-MB- 
231 cells from MCF7 and MCF10A. These genes can be 
classified by their involvement in functional classes such 
as transcription, signal transduction, cell adhesion, cell 
cycle, metabolism, transport, response genes and devel- 
opment (Figure 4). The majority of these genes 
participated in the process of signal transduction 
followed by transcription, cell adhesion and metabo- 
lism, respectively. A few transcripts in these classes were 
tested by real-time RT-PCR to confirm their altered 
abundance. The selected transcripts showed changes 
ranging between two- and 10-fold, 11- and 20-fold and 
greater than 20-fold, and were in close agreement with 
the results of microarray analysis. The qualitative 
pattern of change observed in microarrays analysis 
was readily reproduced by real-time or semiquantitative 
RT-PCR for all transcripts tested. 

The number of altered transcripts was over 1000 
based on a change of twofold or greater, and a majority 
of these genes show changes varying between two- and 
fourfold (Figme 5). IiUeiesliiigly, with all uumpaiisuns 



product was very low (5 x 10 4 c.f.u./^g of DNA). The 
low efficiency of cloning is attributed to a substantial 
fraction of amplified DNA product that is refractory to 
restriction digestion. The sequencing . of a set of 100 
clones each from the difference libraries revealed 50 
different kinds of clones. The majority of these 
sequences were short fragments and represented either 
3' regions or internal fragments of transcripts. A 
summary of these clones is presented in Table 1. The 



combined, there were 21 genes downregulated more 
than 50-fold and 55 genes that were upregulated more 
than 18-fold when specific cell line pairs were compared 
(Figure 5). The transcripts that represent the extremes of 
upregulated and downregulated scale can allow distinc- 
tion between MCF7 and MDA-MB-231 cells. These 
transcripts include GPCR11, cadherin 11, annexin 
Al, vimentin, lactate dehydrogenase B (upregulated 
in MDA-MB-231) and GREB1, S1Q0A8, amyloid 
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Table 1 Differentially expressed transcripts identified by RDA 



Upregulated in MDA-MB-231 



Downreguiated in MDA-MB-231 



Extracellular matrix fmatrix-crosslinking proteins 
LOX 

Laminin f$ 1 
Collagen Vl~a 1 

Calcium-binding proteins 
Reticulocalbin 1 
S100A8 
Cullin 5 

Transcription factors! promoter-binding proteins 
RUNX2 
c-Jun 
Fra-1 

Cell-cell adhesion/ cell-surface receptor proteins: 
Cdhll 
CYR61 

MHC class II antigen y chain 
Protease-activated receptor- 1 • 
Protease-activated receptor-4 



ATPase/GTPase and signal transduction proteins 
ATP7a 
Caveolin 2 

AXL receptor tyrosine kinase 
Rho GEF 3 
Rho/Rac GEF 18 
P21-Rac2 

Me tallopro teases and MMP inhibitor proteins 
TIMP-2 
MT1-MMP 

Stress-response proteins 
Dual specificity phosphatase (DUSP) 

Cytoskeletal component and binding proteins 
Moesin 
Vimentin 
Filamin B 

Cell-cycle regulation and growthl differentiation! apoptosis proteins 
Cyclin Bl 
Cyclin E 
Cyclin A2 
Bcl2-like 1 protein 

Secreted proteins and growth factors 
Milk fat globule protein 
TGF-a 

SMAD-specific E3 ub ligase 2 

Miscellaneous 
Prion protein 



Calcium-binding proteins 
Calgranulin B 

Transcription factors/promoter-binding proteins: 
Chromosome 4 ORF 
Estrogen receptor 1 
RUNX2 variant (exon 8 deleted) 

Cell-cell adhesion! cell-surface receptor proteins 
Claudin 3 

Amyloid p precursor protein 

Triose phosphate isomerase 

Plakoglobin 

Cdh 1 

Cdh3 

Annexin A9 
RAR-a 
Connexin 31 

ATPase/GTPase and signal transduction proteins 

RhoD 

RhoB 

TGF-0 Rl 
Stress-response proteins 

Protein kinase HI 1 

Cytoskeletal component and binding proteins 
Keratin 18 
Tubulin 5 1 

Microtubule-associated protein t 

Cell-cycle regulation and growth/ differentiation! apoptosis proteins 
S100A13 
S100C 

Aurora kinase AIK2 
Nucleosidediphosphate kinase 

Secreted proteins and growth factors 

Trefoil factor 3 (TFF3) 

Trefoil factor 1 (TFF1) 

Four and a half LIM domain 1 

Solute carrier family 16 SLC16A6 
DNA replication 

DNA replication complex GINS-PSF2 
Miscellaneous 

Serine protease inhibitor type 1 (SPINT1) 

Human homolog of Xenopus protein XAG 

Hypothetical protein FLJ22222 

Hypothetical protein 20171 

Hypothetical protein MGC3265 



(S precursor proicia, claudin 3 and cadherin 1 (down- 
regulated in MDA-MB-231). The distinction between 
MCF7 and MCF10A may be made based on keratin 19, 
serine protease, amyloid P precursor, neuropeptide Y 
receptor Yl (upregulated in MCF7) and caldesmon, 
annexin Al, epithelial membrane protein 1, S100A2, 



keratin 15 (downreguiated in MCF7). Likewise, 
MDA-MB-231 cells differ from MCF10A in vimentin, 
epithelial membrane protein 3, cadherin 11, GPCR 
116, collagen type XIII a 1, Bcl2-associated athanogene 
2 (upregulated in MDA-MB-231) and keratin 15, 
cystatiii A, cadherin 1, CD24, calcium-activated chlor- 
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ide channel, S100P, GPCR 87 (downregulated in 
MDA-MB-231). Thus, a small subset of transcripts 
may serve as an accurate signature of these cell lines. 
Several of these gene products have been shown to 
participate in tumorigenesis and invasiveness of breast 
carcinoma cell lines. 

The invasiveness phenotype of MDA-MB-231 cells 
specifically relates to changes in the following functional 
classes: (a) cell adhesion molecules, (b) Ca 2+ requiring, 
Ca 2+ binding or Ca 2+ regulatory genes, (c) copper- 
sensitive or copper-transporting proteins and (d) specific 
regulatory proteins of Rho signaling. Among these 
functional groups, 23 transcripts involved in cell-cell 
or cell-matrix interactions are underexpressed in MDA- 
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MB-231 cells and 21 transcripts were overexpressed in 
this cell line (Table 2). The comparison of Ca 2+ - 
requiring/binding genes indicated downregulation of 
26 transcripts and upregulation of 26 transcripts in 
invasive cells as compared to noninvasive cells (Table 3). 
While Ca 2+ homeostasis is extensively investigated in 
human cancers, copper homeostasis is an underexplored 
area. The alterations in copper homeostasis in breast 
carcinoma cells were reflected by changes in trans- 
cripts corresponding to a variety of copper-binding or 
copper-sensitive proteins/enzymes (Tables 1 and 4). The 
deregulation of Rho signaling was evident from changes 
in various proteins involved in this pathway (Table 4). 
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18 cycles 20 cycles 
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CAVA TGF-fi Moesfa LQX 



Figure 1 Semiquantitative evaluation of selected transcripts, (a) 
RNA was isolated from confluent culture dishes containing 
MCF10A (lanes 1 and 4), MCF7 (lanes 2 and 5) or MDA-MB- 
231 (lanes 3 and 6) cells. The amount of RNA was first determined 
spectrophometrically. Equal amounts of RNA, as determined by 
absorbance at 260 nm, were amplified with primers specific to actin 
gene for 18 cycles (lanes 1-3) or 20 cycles (lanes 4-6). The lane 
containing size markers is labeled as M. (b) A set of primers 
corresponding to caveolin 2 (lanes 1 and 2), TGF-ct (lanes 3 and 4), 
Moesin (lanes 5 and 6), LOX (lanes 7 and 8), Axl receptor (lanes 9 
and 10), RhoD (lanes 1 1 and 12), S100A13 (lanes 13 and 14), TFF3 
(lanes IS and 16) and Claudin 3 (lanes 17 and 18) were amplified 
for 32 cycles. Lanes 1, 3, 5, 7, 9, 12, 14, 16 and 18 represent 
amplified products from MCF7 and lanes 2, 4, 6, 8, 10, 11, 13, 15 
and 17 represent MDA-MB-231 cells. The lanes containing PCR 
products from MDA-MB-231 cells are marked with an asterisk. 




b CYR61 

1 2 




Figure 2 Analysis of selected transcripts by Northern hybridiza- 
tion. The probes specific to CD74 (a), CYR61 (b), SPINTI (c) and 
DUSP (d) were labeled with a "P nucleotide and hybridized to 
blots containing RNA from MCF10A (lane 1), MCF7 (lane 2) 
and MDA-MB-231 (lane 3). The blots were washed stringently and 
developed as described. The amounts of RNA loaded were 
normalized as in Figure 1 . 
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Figure 3 Comparison of cell lines based on the presence or absence of transcripts. The absence or presence of a transcript in the 
AfTymetrix chip was scored by the fluorescence read-out as described in the Materials and methods section. 
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Figure 4 Functional classes of transcripts that differentiate a cell line pair. The transcripts identified as present or absent were 
classified based on their functional importance. 
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Figure 5 Distribution of altered number of transcripts as a 
function of fold change. The altered transcripts were categorized in 
groups based on the magnitude of change in their abundance. 



Proteome analysis 

To identify altered abundance of proteins and relate it to 
transcript profiles, we characterized the protein profiles 
of MCF-10A, MCF-7 and MDA-MB-231 cells. Typi- 
cally, > 300 protein spots could be visualized in silver- 
stained gcfa, and there wae far fctvci prut c iu &yut& 



Table 2 Cell adhesion molecules altered in breast carcinoma cells 
Upregulated in MDA-MB-231 Downregulated in MDA-MB-231 



Cadherin 4 


Claudin3 


Cadherin 1 1 


Cadherin- 1 


Catenin 


Cadherin-3 


Integrin a 6 


Cadherin-18 


Transmembrane anchor protein 


Cadherin, LAG seven pass receptor 


Eph B2 


Down syndrome cell adhesion 


Dystonin 


Catenin-£2 


Laminin /? 1 


Eph A4 


Lamin 


Ephrin A4 


Filamin B 


Annexin A9 


Filamin C 


Ankyrin 3 


Tailin 1 


Sarcoglycan 


Butyrophilin 


Keratin 8 


Spectrin-a 


MAP-7 


Spectrin-/? 


MAP-t 


Thrombospondin 


Plakoglobin 


Plastin 3 


Plakophilin 


Adducin 3 y 


Discoidin domain receptor 


Lamin B receptor 


Zona occludens 3 


Laminin 2 


Periplakin 


Lamin A/C 


Protocadherin a 9 




Laminin y 2 




Laminin a 3 



sputs m 

gels that were stained with Coomassie blue. The 
comparison of MCF7 or MDA-MB-231 proteins with 
MCF-10A revealed that MCF-7 h?d 11 unique pro'ein 
spots, whiie MDA-MB-25i had 15 spots that were not 
seen in MCF-10A. These proteins were either specific to 
or upregulated in these cell lines. The identity of these 
proteins is shown in Table 5. Out of these 26 protein 
spots, only 25 yielded amino-acid sequence. As shown in 
the table, the list includes proteins involved in stress 



response, protein-tagging activities, calcium-binding and 
calcium homeostasis proteins and some regulatory 
proteins. Prominent among these changes were proteins 
iuvulved in ualuiuin humeustasis such 



in uaiuiuni Uumeustasis suuli as crocalbln, 
calreticulin, calcyclin and reticulocalbin. The changes in 
signaling pathways between the two cell lines were 
indicated by altered lev?!s cf Rho GDP dissociation 
inhibitor 1, an apoptosis/differentiation regula- 
ting protein galectin, Myc expression regulator far 
upstream binding protein- 1 and the microtubule reg- 
ulator protein stathmin. The translation initiation 
factors 5A and 4H were also selectively upregulated in 
MDA-MB-231 cells. 
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Table 3 Calcium binding or sensitive transcripts 
Upregulated in MDA-MB-231 Downregulated in MDA-MB-23J 



Reticuiocalbin 
Dystonin 
Follistatin like 1 
Cullin 
Annexin AS 

EF hand domain containing-2 

Hippocalcin like-2 

LDLR 

Steroid sulfatase 

MT-actin crosslinking factor 

Inositol* 1 ,4,5-triphosphate 

receptor 3 

Sorcin 

Guanine nucleotide-binding 

protein-y 

GAS6 

SWP-70 protein 
Jagged 1 

EGF-containing fibrulin-like 
ECM 

Transglutaminase 2 

Thrombin receptor-like 1 

Plastin 3 

FYN oncogene 

PKC-ct 

Calmegin 

Calpain 

HEG homolog 

Cyr61 



S100A8 
S100A7 
SI00A13 

Tumor associated Ca 3+ 
signal transducer 
Notch homolog 3 
PK.D2 

Adenylate cyclase 
Phospholipase-C 
Chemokine ligand 12 
Ubquitin-specific protease 
PKC-* 

Ret-protooncogene 

Signal peptide-CUB domain 

Mannosidase a 

Bradykinin receptor B2 

Solute carrier family 24 

Ca 2+ channel voltage-dependent f?3 

Regulator of G protein signaling 17 

Dystrobevin-a 

Synaptogamin 1 

Matrix gla protein 

EF hand domain family member Dl 

PK-cyclic AMP dependent 

ATPase-Ca 2+ transporting 

Calmodulin 1 

CaM kinase 
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Table 5 Proteomic profiles of MCF7 and MDA-MB-231 cells as 
compared to MCF10A cells 



MCF7 cells 



MDA-MB-231 ceils 



Cell-cell adhesion/ cell-surface 
receptor proteins 
Triose-phosphate isomerase 

Stress-response proteins 
Hsp27 

Superoxide dismutase 
Peroxiredoxin 2 



Cytoskeletal component /binding 
proteins 
Stathmin 

Cell-cycle regulation and growth/ 
differentiation/ apoptosis proteins 

Nucleoside diphosphate kinase A 

S100C 

Secreted proteins and growth factors 
Macrophage migration inhibitory 
protein 

Miscellaneous 
Cyt c oxidase VIb 
Peptidyl-prolyl cis-trans isomerase 
Ubiquitin 



Calcium-binding proteins 
Calcyclin 
Calreticulin 
Crocalbin 
Reticuiocalbin 

Transcription Factors/ 
Promoter-binding proteins 

Far upstream element 

binding protein- 1 

Far upstream element 

binding protein-2 

Cell-cell adhesion/cell-surface 
receptor proteins 
Galectin 

ATPase/GTPase/signal trans- 
duction/ trafficing proteins 

Rho GDP dissociation 

inhibitor 1 

Stress-response proteins 
Hsp 70 

Peroxiredoxin 2 

Cytoskeletal component/ 
binding proteins 
Stathmin 
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Table 4 Altered transcripts involved in copper homeostasis and Rho 
signaling 



Copper-binding proteins 


Rho signaling proteins 


LOX 


Rho3 


LOX-1, LOX-2 


Rho/Rac GEF 18 


SCO cyt oxidase-deficient 


Rho GEF 12 


homolog 2 


Ras related C3 botulinum toxin 


COX 17 homolog 


substrate 2 


Metallothionein IE, IF and 2a 


Cdc 42 effector protein 3 


Ring finger protein7 


Rho GEF 3 


Amiloride-binding protein 1 


Rho GDP dissociation inhibitor p 


Neurotrypsin/motopsin 


RhoD 



Changes in proliferation characteristics and protein 
profiles in response to transfection with antisense 
constructs of selected transcripts 

We had observed significant upregulation of transcripts 
for DNA replication complex protein GINS PSF2, 
trefoil factor 3, aurora kinase AIK2, protein kinase HI 1 
and secreted protein XAG in MCF7 cells. We reasoned 
that antiscnac knockdown o f the above g e nes in MCF7 



Miscellaneous 
Heterogeneous nuclear 
ribonucleo protein H 
eIF4H (translation) 
eIF5A (translation) 



PCR. The effects of antisense transfections were scored 
by growth characteristics of the transfectants. The cell 
proliferation was reduced between 15 and 40% when 
antisense transfectants were compared to cells trans- 
fected with empty vector. 

In order to relate decreased proliferation of transfec- 
tants to altered proteins, proteomic profiles of transfec- 
tants were compared with vector controls. The 
comparison of protein profiles of cells transfected with 
empty vector or antisense construct revealed alterations 
in several proteins for each transfectant (Table 6). These 
proteins included stress-response proteins, calcium- 
regulating proteins, translation factors, ubiquitin, pro- 
teins of electron transport chain and oxidative phos- 
phorylation, signaling proteins, cytokeratins, actin and 
actin regulating proteins and general regulatory factors. 
The uuinbei uf alteied piuteins varied between 3 and 15 



cells might indicate pathways involved in tumorigenesis 
and invasiveness. 

MCF7 cells were trnr.rfectcd with emptv ve::tor 
pCDNA3.1 or amiseuse cou&tructs of the above genes. 
A semiquantitative amplification of pCDNA marker 
gene by RT-PCR confirmed the presence of the 
transfected construct in a significant proportion of the 
cell population. The transfected cells also showed a 
decrease in the target transcripts as observed by RT- 



for various transfections. Peptidyl prolyl cis-trans 
isomerase, calcium-regulating proteins, SOD, galectin, 
hi:' f: dire "triad protein ind PKC substrate v«er« 
nent among aliered proteins. We performed database 
searches to identify interactors for all proteins that were 
altered in transfected cells and identified nearly 350 
proteins (data not shown). A significant number of these 
interacting proteins are involved in transcriptional 
regulation. 
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Table 6 Altered proteins in MCF7 cells after antisense knockdown of specific transcripts 



After transfection with as- trefoil factor 3(TFF3) 



After transfection with as-protein kinase Mil 



Calcium-binding proteins 

Calmodulin 
Cell-cell adhesionj cell-surface receptor proteins 

Retinoic acid-binding protein II 
ATPase-GTPase/signat transduction) trafficing proteins 

Raf kinase inhibitor 
Stress-response proteins 

Hsp27 

Peroxiredoxin 1 
Cytoskeletal component! binding proteins 

Cofilin-nonmuscle isoform 
Cell-cycle regulation and growth! differentiation! apoptosis proteins 

Chromobox protein homolog 1 

Chromobox protein homolog 3 

Translationally controlled tumor protein 
Miscellaneous 

40S ribosomal protein S 1 2 

ATP synthase D chain 

Cancer-associated Sm-like protein 

Cyt c oxidase polypeptide Va 

eIF5A (translation) 

Ferritin heavy chain 

His triad nucleotide-binding protein 

Histone H2B.n 

Peptidyl-prolyl isomerase 

Proteosome subunit 0-type 6 

RNA-binding protein 8A 

Thioredoxin 

Ubiquitin crossreactive protein 

After transfection with as-aurora kinase AIK2 

Calcium-binding proteins 

Calgranulin B 

Calgranulin A 
Stress-response proteins 

Hsp27 

Superoxide dismutase 
Miscellaneous 
60S acidic ribosomal protein P2 
Peptidyl-prolyl isomerase 
Ubiquitin-crossreactive protein 



Cell-cell adhesion/ cell-surface receptor proteins 
Galectin 

Retinoic acid-binding protein II 
ATPase-GTPase/ signal transduction/ trafficing proteins 

PKC substrate 
Stress-response proteins 

Hsp27 

Cytoskeletal component/binding proteins 

Actin-a, skeletal muscle 

Cy to keratin 18 
Miscellaneous 

30H-acyI CoA dehydrogenase II 

ATP synthase A chain 

Cyt c oxidase peptide Va 

Enhancer of rudimentary homolog 

Thioredoxin 

Ubiquinol-cyt C reductase 
No match 

After transfection with as-DNA replication complex GINS PSF2 
Calcium Binding proteins: 
Calreticulin 

A TPase-GT Pose /signal transduction/ trafficing proteins 

14-3-3 n 

PKC substrate 
Cytoskeletal component/binding proteins: 

Cytokeratin 1 

Cytokeratin 18 

Cell-cycle regulation and & growth/ differentiation) apoptosis proteins 

Chromobox protein homolog 1 

Chromobox protein homolog 3 

MAP/MT affinity regulator 
Miscellaneous 

ATP synthase D chain 

Peptidyl-prolyl isomerase 

Ubiquinol-cyt c reductase 

After transfection with as-human homolog of XAG 
Calcium-binding proteins 

Calreticulin 
Stress-response proteins 

Hsp27 
Miscellaneous 

ATP synthase A chain 

Peptidyl-prolyl isomerase 

Ubiquitin crossreactive protein 

No match 



Discussion 

The results presented here validate the gene profiles 
obtained from different expression platforms ranging 
from subtractive hybridization to gene microarrays and 
proteomic analysis. The RDA protocol is powerful 
enough to yield important genes that sh o w significant 
alterations in their expression between cell lines, and can 
lead to isolation of full-length cDNAs by using 
appropriate modifications (Raskarn.n et a.l. y 1996: Jacob 
e/ al., iy97>. Trie detection of RUNX2, variant of 
RUNX2, EphB6, prion protein, lysyl oxidase and a 
copper transporter ATP7A transcripts by RDA warrant 
specific mention. RUNX transcription factors bind 
specific motifs on target gene promoters and regulate 
gene expression leading to cell growth, proliferation and 



differentiation (Pratap et al, 2003). RUNX2 and its 
variant have differential repression activity toward the 
promoter of the cyclin-dependent kinase inhibitor 
(p21CIPl) (Westendorf et al, 2002). The loss of EphB6 
expression due to methylation of its promoter is related 
to invasiveness of MDA-MB-231 (Fox and Kandpal, 
2004; unpublished ubscivations). Lysyl uxidas e , 1 vr 
copper-sensitive enzyme, causes oxidative deamination 
of lysine and hydroxy lysines of collagen to aldehyde 
forms to stabilize colfacen fibrils (S?eg?I, \91f) tbat are 
found in invasive breasc carcinoma ceiis (Akiri et al, 

2003) . The activation of LOX is dependent on copper 
that is internalized and then transported to trans golgi 
network by copper transporter ATP7A (Pase et al, 

2004) , a protein mutated in Menkes disease (Moller 
et al, 2005). Prion protein has also been reported as a 
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chaperone for copper (Jones et ai, 2004). Thus, EphB6 
can serve as a marker for invasiveness, and LOX and 
ATP7A may be exploited as relevant targets for 
therapeutic intervention. 

The downregulation of junctional proteins along with 
inactivation of TIMPs as shown here is in agreement 
with other reports describing their relationship with 
invasiveness of carcinoma cells (Johnson, 1991; Kousi- 
dou et al., 2004; Shao et al., 2005) and promoter 
methylation (Costa et al. 9 2004). As several transcripts 
coding for junctional proteins are downregulated in 
invasive cells, we postulate that methylation-specific 
PCR can be exploited to use these transcripts as 
biomarkers of tumor cells in general and invasiveness 
in particular. The changes in cell-cell interaction 
correlate to cell phenotypes because such interactions 
influence Rho/Ras signal transduction pathways and 
vice versa (Malliri and Collard, 2003; Nagaraja and 
Kandpal, 2004; Ridley, 2004), and lend credence to the 
significance of altered transcripts for Rho and Rho 
GEFs as presented here. 

Early changes in calcium homeostasis as measured by 
calcium excretion have been reported in breast cancer 
(Campbell et al 9 1983), and altered calcium signaling has 
been shown in invasive lung carcinoma cells (Amuthan 
et al. t 2002). Prominent among calcium-binding proteins 
are S-100 protein, a group of intracellular messengers 
that respond to transient changes in calcium concentra- 
tion by binding to specific receptors (Marenholz et al. y 
2004) and regulate cell growth, differentiation and 
motility, transcription and cell cycle. The S-100 proteins 
detected in the present study map to chromosome lq21, 
a region of genome that is frequently altered in human 
breast cancer cells (Bieche et aL 9 1995). Calcium ions act 
as a second messenger in specific signaling pathways in a 
variety of cancers (Missiaen et ai 9 2000) and are known 
to alter calcineurin to activate transcription factors such 
as NFATc (Luo et al 9 1996). 

As dictated by post-transcriptional regulation, protein 
profiles showed far fewer changes as compared to 
transcript profiles, and the knockdown of five selected 
-genes Jn MCF7 cells produced interesting changes in 
protein profiles. These genes, namely, XAG (secretory 
Xenopus laevis protein), trefoil factors 3, human aurora2 
kinase AIK2, protein kinase Hll and DNA replication 
complex GINS PSF2, have been shown to be estrogen 
responsive, oncogenic or involved in tumorigenesis (Yu 
et al. y 2001; Fletcher et al., 2003; Katoh, 2003; Warner 
et al, 2003; Takayama et al 9 2003). The antisense 
constructs of these genes appeared to work as siRNAs 
as suggested by the reduction in the transcript detected 
in RT PCR of RNA isolated from tho tranafootcd oellfl. 



most likely mediated via its interaction with ATM 
protein. Raf kinase inhibitor (Keller et al 9 2004) and 
ATM (Hall, 2005) have been conclusively linked to 
transformation of cells, and the activity of Pin 1 has been 
related to p53-mediated signaling pathways (Mantovani 
et al, 2004; Berger et ai 9 2005). In this context, p53 
activation has also been hypothesized by Cu-SOD 
prion-like enzyme (Wiseman, 2005). Thus, alterations 
in copper homeostasis and p53-mediated signaling may 
be considered as a significant regulatory mechanism in 
tumorigenesis. 

In summary, we have presented here a set of 
candidate genes that can serve as biomarkers for 
tumorigenesis and invasiveness, and some of these 
markers may be used to develop DNA-based diagnostic 
tests. The alterations in transcripts for copper home- 
ostasis genes suggest copper chelation or inhibition of 
copper transporter ATP7A as potential targets for 
therapeutic application. The modulation of RUNX2 
splicing variants by chemicals that affect splicing 
machinery may also be explored as a therapeutic 
modality. The changes in EphB6 expression, if con- 
firmed in tumor specimens, may have prognostic 
significance. 



Materials and methods 

Breast cancer cell lines 

We used MCF-10A, a cell line established from normal breast, 
and two breast carcinoma cell lines MCF-7 and MDA-MB- 
231 that vary in their in vitro and in vivo invasiveness. All cells 
were cultured at 37°C/7% C0 2 . MCF-10A cells were grown in 
1:1 DMEM:F12 media (Gibco) with 5% horse serum (Gibco), 
20 mM HEPES, lOng/ml EGF (Invitrogen), 10 ml/1 PenStrep- 
Glutamine (10000 U/ml penicillin, 10000 ng/ml streptomycin 
and 29.2mg/ml L-glutamine), 10/jg/ml insulin (Invitrogen), 
0.1 jig/ml Cholera Toxin (Sigma) and 500ng/ml hydrocorti- 
sone (Sigma). MCF-7 and MDA-MB-231 cells were grown in 
DM EM (Gibco) supplemented with 2mM L-glutamine (Gib- 
co), 1 mM sodium pyruvate (Gibco), 5 ml/1 penstrep (5000 U/ 
ml penicillin and 5000/ig/ml streptomycin), and 10% fetal 
bovine serum (Hyclone). 

Total RNA isolation 

RNA was isolated from 85. to 95% confluent 10 cm tissue 
culture dishes using TRI reagent (Molecular Research Center 
Inc.) with slight modifications to the recommended protocol. 
Approximately 10 million cells were mixed with 1.0 ml Tri 
Reagent, the mixture was extracted with chloroform and the 
aqueous phase containing RNA was separated. The RNA was 
precipitated with isopropanol, the pellet washed sequentially 
with 80 and 100% ethanol, then dried and resuspended in 
DiiFC-treated water. RNA was stored in aliquots at -70°C. 
The quality of RNA was visualized by running, on a 
formaldehyde gel. The appearance of ribosomal RNA bands 
indicated that RNA was not degraded during the procedure. 
The amount of RNA was determined by its absorbance ut 
260 nm. 

DNAase treatment of total RNA 

To remove DNA contamination, 20 of RNA (quantified 
spectrophotometricaUy) was treated with 500 ng DNAase I, 



The involvement of the above transcripts in invasive 
potential is apparent from the observed upregulation of 
calcium-binding proteins in transfected MCF7 cells, 
which is comparable io their levels in MDA-MB-231 
cells. The proteins that appear to mediate inhibition of 
proliferation in antisense-transfected cells include PKC 
substrate, Raf kinase inhibitor, histidine triad nucleo- 
tide-binding protein and peptidy-prolyl isomerase 
(Pinl). We believe histidine triad protein effects are 



Oncogene 



BesLAvailable Copy 



Molecular signatures of breast carcinoma cells 
GM Nagaraja et at 



2336 



80 U RNasin (Promega) and 1 mM MgCl 2 in Tris buffer in a 
total volume of 50 The reaction was carried out at 37°C for 
1 h and the DNAase inactivated by heating to 65°C for 30min. 

Representational difference analysis (RDA) 
RDA of cDNAs is a modification of genomic RDA (Lisitsyn 
et a/., 1993). We performed RDA in the following two ways. In 
one experiment, MCF-7 cDNA was used as a driver and 
MDA-MB-231 cDNA as a tester. In the second experiment, 
MCF-7 cDNA was used as a tester and MDA-MB-231 cDNA 
as a driver. The protocol has been describer previously (Jacob 
et al. y 1997). Briefly, first-strand synthesis was carried out using 
a commercial cDNA synthesis kit as per the manufacturer's 
protocol. A linker with a BgUl site was ligated to the tester as 
well as the driver cDNA. A primer specific to one of the linker 
strands was used to PCR amplify the linker-ligated cDNAs. 
The linkers were then removed by digesting the cDNA with 
BgUl arid the digested cDNA was gel purified. A second set of 
unphosphorylated BgUl adaptors was ligated to the tester 
cDNA only. The tester and driver DNAs were hybridized in a 
5 y\ reaction volume at a ratio of 1 :40. After hybridization, the 
ends of the tester homoduplexes were repaired with Klenow 
polymerase and 1 /d of the reaction mixture was diluted to 
100 /iL The difference product was obtained by amplifying 1 /il 
of the diluted mixture using the top strand of the ligated 
adaptor as a primer. The amplified difference product was 
digested with EcoRl and cloned in a pBlueScript vector. 
Individual clones were picked up and sequenced by Sanger's 
dideoxy chain termination method. Representative clones were 
validated by Northern analysis and semiquantitaive RT-PCR. 

Microarray analysis 

The GeneChips, Human Genome U133A 2.0, (Affymetrix, 
Santa Clara, CA, USA) used in this study contained 
approximately 22000 probe sets corresponding to 18 400 
transcripts and variants, including 14500 well-characterized 
human genes. 

Total RNA was converted into double-stranded cDNA by 
using Superscript II (Invitrogen, Carlsbad, CA, USA) and an 
oligo-dT primer containing a heel of the T7 RNA polymerase 
promoter sequence. The reaction mixture containing double- 
stranded cDNA was extracted with phenol-chloroform, 
precipitated with ethanol, and dissolved in 12/d RNase-free 
water. The cDNA was transcribed in vitro by using a RNA 
transcription labeling kit (Enzo Biochem, Famingdale, NY, 
USA) with 6 /d of double-stranded cDNA in the presence of 
ATP, CTP, GTP, UTP, bio-ll-CTP and bio-16-UTP. The 
biotinylated RNA was purified by using an affinity column 
{Qiagen, Valencia, CA, USA) and fragmented randomly, by 
heating to 95°C in the presence of fragmentation buffer, 
between sizes of 35 and 200 bases. The GeneChips were 
hybridized overnight at *45°C in hybridization oven in a 
solution containing fragmented cRNA, control oligonucleo- 
tide B2, 20 x eucaryotic hybridization controls, herring sperm 
DNA, acetylated BSA and 2x hybridization buffer. The 
GeneChipc w e r e washed and otainod with otroptavidin 



MAS5 analysis revealed a positive quality report. Ratios of 
average signal intensity (log 2) were calculated for the probe 
sets between pairs of cell lines and then converted to an 
average fold change (AFC). Statistical validation was per- 
formed on probe sets as described (Yoshida e t a/., 2004). The 
statistical method used to assign P-values to the fold changes 
of gene responses is described by Yoshida et al. (2004) and is a 
two-step procedure based on the Benjamini and Yekutieli 
construction of false discovery rate confidence intervals 
(FDRCI) (Reiner et al 9 2003). Functional annotation of 
proteins was assigned through Gene Ontology (http://www. 
geneontology.org) or Locuslink (http://www.ncbi.nlm.nih.gov/ 
LocusLink) classifications obtained through appropriate pub- 
lic databases. 

Quantitative RT-PCR 

RNA was reverse transcribed with Superscript II (Invitrogen, 
Carlsabad, CA, USA) RT by priming with oligodT. The 
primers specific to validated genes were synthesized from the 3' 
untranslated region using Primer 3 software. PCR reactions 
were then performed in triplicates in an I-cycler Thermocycler 
with optical module (BioRad, Hercules, CA, USA). The 
amplified products were quantified by reading fluorescence of 
SybrGreen I (Molecular Probes, Eugene, OR, USA). Average 
fold changes were calculated by differences in threshold cycles 
(C t ) between pairs of samples to be compared. HPRT gene was 
used as a control. 

Semiquantitative RT-PCR 

The spectrophotometrically determined concentration of RNA 
was confirmed by amplifying actin message at different cycles. 
The cycling conditions that yielded proportional increment of 
amplified product was used to normalize the RNA concentra- 
tion. The normalized RNA was used as template to determine 
relative abundance of transcripts corresponding to clones 
identified by RDA experiments. The conditions were standar- 
dized in the range of cycles that yielded a PCR product for at 
least one of the pairs of compared RNAs. Such experiments 
dictated cycles between 30 and 35 to be appropriate to 
compare abundance of selected transcripts in MCF7 and 
MDA-MB-231 cells. 

Northern analysis 

The expression pattern of selected transcripts in cell lines was 
also analysed in Northern blots. RNA (20 /nn), as determined 
by spectrophotometer and confirmed by actin amplification, 
was electrophoresed on a formaldehyde agarose gel. A RNA 
ladder was used as a size marker. The RNAs were transferred 
from the gel to a Hybond nylon membrane by capillary 
transfer. The RNA was fixed onto the membrane by 
irradiation in a Stratalinker. The blot was hybridized at 65°C 
for 12-1 5 h with a radioactive probe and the blot was 
subsequently washed with 0.1 x SSC and 1% SDS at 65°C. 
The hybridized probe was detected by autoradiography. 



Transfcction of MCF ' 7 cells w ith antiserum c onstru c ts 

The genes selected on the basis of their upregulation were 
cloned in antisense orientation in pCDNA3.1 vector (Inviro- 
gen). MCF-7 cells were grown to 70-80% confluence. 
A- - pio;-::..v.i-:lj of .DN.\ . "»v;.i.s ir;Lnof<:cic:i iv-io 

cells by using Lipofectamine 2000. The transfected cells were 
grown in the presence of G418 (400 jig/ml). The transfectants 
were processed for protein isolation. A control set of cells was 
transfected with an empty pcDNA3.1. The proteins were 
analysed by two-dimensional electrophoresis, and altered 
bands were excised for mass spectrometry. 



phycoerythrin and the antibody in 2x MES stain buffer, 
acetylated BSA, and optically read at a resolution of 6 /mi with 
a Affymetrix GeneChip scanner 3000. Affymetrix MICRO- 
AI'.RAY SI" if if J :\.:r initial dj-:a pn.'^iriiii j:i 

(generation of .CHP files). Normalization (quantile method) 
and calculation of signal intensities was performed with the 
software package RMA from the R project (http://www.r- 
project.org/). For every cell line, three replicates were 
performed with AfTymetrix Gene Chips. The Gene Chip data 
were used for further calculations after the raw image and 
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Protein isolation 

The cultured cells were harvested by trypsinization and 
centrifuged at 220 g for 5min at 4°C. The cell pellet was 
washed once with ice-cold 1 x PBS. The proteins were isolated 
by using a commercial kit (BioRad, Hercules, CA, USA). 
Briefly, pelleted cells (0.05 ml) were mixed with 0.5 ml ice-cold 
CPEB solution containing protease inhibitors cocktail 
(Roche), vortexed and stored on ice for 30min. The cell 
suspension was passed through a syringe needle (20 gauge) for 
10-20 strokes to ensure complete cell lysis. The cytoplasmic 
protein fraction was collected by centrifugation at 100 g for 
lOmin at 4°C. The nuclear pellet was washed once again with 
0.5 ml CPEB solution. The nuclear pellet was resuspended in 
0.75ml PSB buffer, vortexed briefly and centrifuged at lOOOg 
for lOmin at 4°C, and the supernatant containing nuclear 
protein was collected into a new tube. The samples were 
quantified using 2D Quant kit (Amersham Biosciences), 
aliquoted and stored at -80°C to prevent protein degradation. 
To reduce streaking, background staining and the other gel 
artefacts associated with substances contaminating 2D/IEF 
samples, the samples were cleaned with 2D Clean up kit (Bio 
Rad, Hercules, CA, USA) before running on the gel. 

Two-dimensional gel electrophoresis 

The protein mixtures were separated based on isoelectric 
points by using commercial pre-cast pH gradient gel strips 
according to the manufacturer's instructions. The protein 
sample (175 /zg) in 185^1 of sample bufTer (8M urea, 2% 
CHAPS, 0.2% biolytes, 3/10 ampholytes, 65 mM DTT and 
0.002% bromophenol blue) was loaded in the sample loading 
trays at the end of 1 1 cm immobilized rehydrated strips <pH 3- 
10) (Bio Rad, Hercules, CA, USA). Following isoelectric 
focusing, proteins were reduced and alkylated by successive 
15min treatments with equilibration buffer (6M urea, 0.375 M 
Tris-HCl pH 8.8, 2% SDS, 20% glycerol, 2% DTT) and 2.5% 
(W/V) iodoacetamide, respectively. Proteins were then re- 
solved in the second dimension on 8-16% gradient SDS- 
PAGE gel (Bio Rad, Hercules, CA, USA). The protein spots 
were visualized by staining with either silver stain or 
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Coomassie blue stain. The gel images were compared and 
bands showing significant (greater than twofold) alterations in 
intensity were excised and processed for mass spectrometry. 
Comparisons were made between protein lysates from MCF- 
10A, MCF-7 and MDA-MB-231 cell lines or between MCF-7 
and MCF-7 cells transfected with specific antisense constructs. 

Protein identification by enzymatic digestion followed by mass 
spectrometry 

Prior to performing trypsin digestion, the gel pieces containing 
protein spots were washed sequentially once with water and 
twice with acetonitrile. The gel pieces were then allowed to 
swell in lOOmM ammonium bicarbonate and finally washed 
with acetonitrile. The washed slices were dried in a Speed Vac 
concentrator, and subsequently incubated with 20 /d of 
Promega's autocatalysis-resistant trypsin (12.5ng//d in 50 mM 
ammonium bicarbonate and 5 mM CaCl 2 , pH 8.0) overnight at 
37°C. The supernatant (5 /xl) from tryptic digest was injected 
for peptide sequence analysis using on-line capillary liquid 
chromatography-electrospray ionization-tandem mass spectro- 
metry (LC-MS/MS). The front end HPLC utilized a Dionex 
(San Francisco, CA, USA) Vydac 300 inner diameter 
x 15 mm CI 8 column. The linear acetonitrile gradient (3%/ 
min, containing 0.02% TFA) was developed using a Hewlett- 
Packard 1 100 pump operating at 0.1 ml/min, and the flow was 
split before the injector such that the flow rate through the 
column was 3 /il/min. Peptides were detected at 215 nM. The in- 
line mass spectrometer was a ThermoElectron LCQ-DECA 
instrument operated in data-dependent MS/MS mode, and 
proteins were identified by searching a nonredundant protein 
database using the Sequest program. 
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Identification of androgen-regulated genes in the 

prostate cancer eel! line LNCaP by serial analysis 
oi gene expression at id proteomio analy sis 

A common therapy for nonorgan-confined prostate cancer involves androgen depriva- 
tion. To develop a better understanding of the effect of androgen on prostatic cells, we 
have analyzed gene expression changes induced by dihydrotestosterone (DKT) in the 
androgen responsive prostate cancer line LNCaP, at both RNA and protein levels. 
Changes at the RNA level induced by DHTwere determined by means of serial analysis 
of gene expression (SAGE), and protein profiling was done by means of quantitative 
two-dimensional polyacrylamide gel electrophoresis. Among 123371 transcripts anal- 
yzed, a total of 28844 distinct SAGE tags were identified representing 16570 genes. 
Some 351 genes were significantly affected by DHT treatment at the RNA level 
(p < 0.05), of which 1 47 were induced and 204 repressed by androgen. In two indepen- 
dent experiments, the integrated intensity of 32 protein spots increased and 12 
decreased at least two-fold in response to androgen, out of a total of 1031 protein 
spots analyzed. The change in intensity for most of the affected proteins identified 
could not be predicted based on the level of their corresponding RNA. Our study pro- 
vides a global assessment of genes regulated by DHTand suggests a need for profiling 
at both RNA and protein levels for a comprehensive evaluation of patterns of gene 
expression. 



Keywords: Androgen / Prostate / LNCaP / Gene expression / Serial analysis of gene expression 
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1 Introduction 

Androgens affect numerous aspects of prostate biology 
including development, growth, and maintenance. They 
also play a critical role in tumorigenesis and progression 
of prostate cancer. Androgen deprivation is an estab- 
lished treatment modality for prostate cancer. However, 
the disease eventually progresses into a hormone refrac- 
tory cancer. Several mechanisms have been identified 
which may contribute to androgen independence: (1) 
Mutations in the androgen receptor (AR) lead to ligand- 
independent activation or promiscuity of the receptor 
[1, 2j. Th e a bilit y o f t h e r ec ep t or t o act i vat e o r repress 
downstream genes can also be affected by mutation. AR 
mutation is associated with advanced phases of prostate 
cancer [3, 4]; (2) AR can be activated in a ligand-indepen- 
dent manner by specific growth factors and cytokines [5]; 
(3) AR gene amplification has been found to occur in 
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28-30% of tumors that recurred post androgen-ablation 
therapy [6, 7]. Wallen er a/. [8] have recently shown that 
one-third of locally recurrent hormone refractory prostate 
cancer contain AR gene amplification; (4) Coactivator 
amplification and corepressor down-regulation have 
been shown to increase receptor transactivation [9, 10]. 
Regardless of which pathway(s) is taken by the tumor 
cells, androgen-regulated genes may ultimately be the 
key players in the development of hormone refractory 
cancer. As a step towards a better understanding of the 
effect of androgen on gene expression, we have under- 
taken a comprehensive assessment of gene expression 
cnanges induced by dihydrotesiosterone turn) in tne 
androgen responsive prostate cancer cell line LNCaP. An 
important feature of our study is the parallel assessment 
of expression changes at both RNA and protein levels. 



2 Materials and methods 

2.1 Prostate cell line and DHT treatment 

LNCaP human prostate cancer cell line (American Type Cul- 
ture Collection, Rockville, MD, USA) was cultured for 3 d in 
phenol-free RPM1 1640 supplemented with 5% charcoal- 
stripped fetal bovine serum (FBS) at 37°C. Half of the cul- 
tures were then treated with 1 0" 9 m DHT for 24 h. Total RNA 
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and protein were extracted from untreated and DHT treated 
cells using TRIzol (Invitrogen, Carlsbad, CA, USA) reagent 
Ir . ■ F f ■' * k • " o ?. •*■ ! t- : f i • i c n coc ,,:i . c. ', I f c ~ p : o L £ s c r? j* 1 1 i ro 
of 9 m urea, {i-mercaptoethanoi and 1 % NP-40. For kinetics 
experiments 1 x 10 6 cells were plated in phenol-free RPMI 
medium with 5% charcoal-stripped FBS. On day three, 
cells were treated with 5 mL of media containing 1 0" 9 m 
DHT and total RNA was extracted at different time points 
from 0-96 h. To determine the direct/indirect effect of andro- 
gen on RNA levels, cells were plated as above and treated 
with 5 ng/mL of cyclohexamide in the presence or absence 
of 1 0" 9 m DHT for 24 h and total RNA was extracted. 



2.2 Serial analysis of gene expression 

Serial analysis of gene expression (SAGE) was performed 
as described previously [11] with the following modifi- 
cations: ditags were PCR amplified using biotinylated pri- 
mers and digested with Ma I II enzyme [12]. Concatemers 
were heated for 15 min at 65°C and chilled on ice for 
10 min before separation on an 8% polyacrylamide gel 
[13]. The concatemers were then cloned into the Sphl 
site of the pZero vector (Invitrogen). Concatenated tags 
were screened by PCR using M13 forward and M13 
reverse primers. PCR products with inserts greater than 
500 bp were isolated and sequenced with M1 3 forward 
primer on an automated 3700 DNA sequencer (Perkin- 
Eimer, Norwalk, CT, USA). For microSAGE, 1 \ig of total 
RNA per tube was used for cDNA synthesis in two tubes, 
with the mRNA Capture Kit (Boehringer Mannheim, 
Indianapolis, IN, USA. cDNAs were cleaved with Mai II, 
ligated to linkers and then digested with Bsm Fl enzyme. 
The released tags were ligated, and processed for the 
rest of steps as with the standard SAGE protocol. 

2.3 SAGE data analysis 

SAGE tags were extracted using the SAGE software 
V 4.12 [11]. Tags were matched to the SAGE reliable 
map (release 10-26-2000) (http://www.ncbi.nlm.nih.gov/ 
SAGE/). Due to the fact that some tags map to multiple 
genes and some genes have multiple tags, SAGE data 
were analyzed in two different ways: (1) exclusion 
method: tags that match to multiple genes were dis- 
carded. Only tags that match to a single gene were tabu- 
lated and composite counts analyzed for their signifi- 
cance; (2) inclusion method: tags that match to multiple 
genes were counted at 1 00% toward each gene. All tags 
were tabulated and composite counts analyzed for their 
statistical significance. Lists of differentially expressed 
genes (p<0.05) obtained from the exclusion and inclu- 
sion methods were compared, and finally only genes that 
have a p value < 0.05 in both lists were considered statis- 
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tically significant. -The total number of genes identified 
was estimated by Nm + (N^ - 0.1 NuJ/3.5, where N m is 

the rv-Trb^r of penes m; : ."ch;:d 'c S.-^GTi t.?.^s. I* i? t he- 
number of SAGE tags that do not match to known genes 
or ESTs, with 1 0% representing the estimated sequencing 
error per SAGE tag and 3.5, the average number of tags 
per gene in the SAGE reliable map (release 1 0-26-2000). 



2.4 RT-PCR and real-time PCR quantification 

For reverse transcription-PCR (RT-PCR), 1 ng of total RNA 
was reverse transcribed into cDNA. One fortieth of cDNA 
was used for PCR reaction. Samples were collected at 
different cycles and separated on a 2% agarose gel with 
ethidium bromide. Image was captured and quantified 
using NucleoVision 760 Image Workstation (Nucleotech, 
CA, USA). Amplification curves were generated. Sub- 
sequently, RT-PCR was done at cycles within the log 
phase of amplification. 

Real-time quantification was performed in the iCycler (Bio- 
Rad, Hercules, CA, USA). Briefly, one fortieth of cDNA was 
used in each reaction. Six reactions were carried out for 
each gene and three independent experiments were per- 
formed. PCR mix comprised of 1X PCR buffer, 1.5 mM 
MgCfe, 0.1 mM dNTP, 200 nM primers (listed below), 0.05 U 
platinum Taq polymerase (Invitrogen) and 0.1x SYBR green 
(Molecular Probes, Eugene, OR, USA). PCR was earned out 
at 94°C for 2 min, and 40 cycles of 94^C for 30 s, 55°C for 
30 s, and 72°C for 30 s. Primers used for real-time PCR 
quantification: PSA (prostate specific antigen) (Hs.171995, 
5 '-GG AAATGACCAGGCCAAG AC-3* , 5'-CAACCCTG GAC 
C TCACACCTA-3VSCMH1 (Hs.57475 t 5'-GCCTTGACC 
ACATCACTCCAT-3', 5 , -AGGCCTAGGGCTGCAAAAG-3'), 
and clusterin (Hs.75106, S'-GCAGGAATACCGCAAAAA 
GC-3\ 5 '-GACTC AAGATGCCCCCGTAAG-3'). Standard 
samples (50, 25, 1 0, 5, 2.5, 1 and 0.5 \iL cDNA) and experi- 
mental samples were used in real-time quantification PCR. 
Each sample was run in q uadru ple reactions. Standard 
curve (Ct = mX + B) was obtained, where Ct is threshold 
cycle number, X is log quantity of target molecules, m is 
curve slope and B is Y-axis intercept value. Number of 
fold induction or repression for a given target molecule 
was calculated by Qa/Qb, where Qa is the target quantity 
in experimental sample A and Qb is the target quantity 
in experimental sample B. Cta - mlogQa + B or Qa = 
1 n* 0 *^, therefore, Qa/Qb = 1 o* 0 ***^. 



2.5 2-D PAGE 

The procedure followed was as previously described 
[14]. Cells were solubilized in 200 \iL of lysis buffer con- 
taining 9.5 m urea (Bio-Rad), 2% NP-40, 2% carrier 



BestAvailable Copy 



Proteomics 2001, 7, 1327-1338 

ampholytes pH 4-8 (Gallard/Schlessinger, Carte Place, 
NY, USA), 2% p-mercaptoethanol and 10 mM PMSF. 
A r c-K\r i-~ r cor V 5x10 F ell-, '.v:--\? 

applied onio isorocustng gels. l£F was conaucteo using 
pH 4 to 8 carrier ampholytes at 700 V for 16 h, followed 
by 1000 V for an additional 2 h. The 1-D gel was loaded 
onto the 2-D gel, after equilibration in 1 25 mM Tris, pH 6.8, 
10% glycerol, 2% SDS, 1% DTTand bromophenol blue. 
For the second dimension separation, a gradient of 
11-14% of acryiamide (Serva, Hauppauge, NY, USA) 
was used. Following 2-D PAGE, proteins were visualized 
by silver staining of the gels or transferred to an Immobi- 
lon-P PVDF membrane (Millipore, Bedford, MA). 



2.6 Protein identification by mass spectrometry 

The 2-D gels were silver stained by successive incuba- 
tions in 0.02% sodium thiosulfate for 2 min, 0.1 % silver 
nitrate for 40 min and 0.014% formaldehyde plus 2% 
sodium carbonate. The proteins of interest were excised 
from the 2-D gels and destained for 5 min in 15 mM 
potassium ferricyanide and 50 mM sodium thiosulfate 
as described [15]. Following three washes with water, 
the gel pieces were dehydrated in 100% acetonitrile for 
5 min and dried for 30 min in a vacuum centrifuge. 
Digestion was performed by addition of 1 00 ng of trypsin 
(Promega, Madison, Wl, USA) in 200 mM ammonium 
bicarbonate or by addition of 100 ng of the endopro- 
teinase Glu-C (Promega) in 100 mM ammonium bicarbo- 
nate. The Lys-C digestion was performed with 500 ng of 
the endoproteinase Lys-C (Roche, Mannheim, Germany) 
in 100 mM Tris-HCI, pH 9. Following enzymatic digestion 
overnight at 37°C, the peptides were extracted twice 
with 50 \iL of 60% acetonitrile/1 % TFA. After removal of 
acetonitrile by centrifugation in a vacuum centrifuge, the 
peptides were concentrated by using pipette tips C18 
(Millipore). 

Analyses were performed primarily using a PerSeptive 
Biosystem MALDI-TOF Voyager-DE mass spectrometer 
(Framingham, MA, USA), operated in delayed extraction 
mode. Peptide mixtures were analyzed using a saturated 
solution of cyano-4-hydroxycinnamic acid (CHA) (Sigma, 
St. Louis, MO, USA) in acetone containing 1% TPA 
(Sigma). Peptides were selected in the mass range of 
800-4000 Da. Spectra were calibrated using calibration 
mixture 2 of the Sequazyme peptide mass standards kit 
(PerSeptive Biosystems). The search program MS-Fit, 
developed by the University of California at San Francisco 
(http://prospector.ucsf.edu), was used for searches in the 
NCBI database. Search parameters were as follows: 
maximum allowed peptide mass error of 400 ppm, con- 
sideration of one incomplete cleavage per peptide and 
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pH range between 4 and 8. MALDI-TOF mass spectro- 
metry was also used for molecular weight determina- 
te cn es d^ncrib^d f16]. 'n sc:r;e cases, the arrino ncid 
sequence of some peptides of interest was determined 
by ESI MS analysis. 

3 Results 

3.1 SAGE analysis of the effect of androgen 
on gene expression 

SAGE libraries were generated from LNCaP cells cul- 
tured in the presence or absence of DHT. A total of 
123371 tags were generated of which 6287^8 were from 
the LNCaP cell line and 60493 from the LNCaP cells 
treated with DHT for 24 h (Table 1). Sequence analysis 
identified a total of 28844 distinct tags representing 
16570 genes, 11 243 from LNCaP and 12203 from DHT 
treated cells. A total of 351 transcripts were differentially 
expressed at a significant level (p<0.05). Eighty-seven 
percent of transcripts matched GenBank entries; 79% 
corresponded to known sequences and 8% to ESTs. 
RNA levels for 147 genes were increased and 204 genes 
were decreased after DHT treatment (Table 2). Therefore, 
at the RNA level more genes were repressed than 
activated by androgens. Of these androgen-regulated 
genes, 149 were changed ^ five-fold by DHT treatment 
(Table 3). 



Table 1. Summary of SAGE analysis in LNCaP cells 



Sample 


Total 


Distinct 


Number 




transcripts 


tags 


of genes 


-DHT 


62 878 


17 050 


11 243 


+ DHT 


60 493 


18510 


12 203 


Total 


123 371 


28 844 


16 570 



- DHT: LNCaP cells without dihydrotestosterone; + DHT: 
L NCaP ce ll s -t reated w i th 1CT 9 m d i hydrote st o s t erene-fer- 
24 h 



Table 2. DHT regulated genes in LNCaP cells 



Genes 


LNCaP vs LNCaP + DHT 




(gene #) 






p 0.05 


p 0.01 


Known genes 


277 


147 


ESTs 


29 


8 


No match 


45 


13 


Total 


351 


168 


Up-regulated by DHT (24 h) 


147 


65 


Down-regulated by DHT (24 h) 


204 


103 
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Table 3. List of candidate genes that are regulated by DHT (> 5 fold) 


UGDorTag 

- 


CAP 


DHT 




Change 

— — -• 


P value 




Description 

- 


283305 


1 


82 


Induced 


0 


immunoglobulin heavy contant alpha 1 


183752 


0 


42 


Induced 


0 


microseminoprotein, beta- 


140 


1 


44 


Induced 


0 


immunoglobulin heavy constant gamma 3 (Gm marker) 


75415 


18 


95 


Induced 


0 


beta-2-microglobulin 


84298 


0 


17 


Induced 


1.00E-05 


■CD74 antigen (invariant polypeptide of major histocompability 












complex, class II antigen-associated) 


9615 


0 


16 


Induced 


1.00E-05 


myosin regulatory light chain 2, smooth muscle isoform 


77443 


0 


16 


Induced 


1.00E-05 


actin, gamma 2, smooth muscle, enteric 


1119 


0 


15 


Induced 


2.33E-05 


nuclear receptor subfamily 4, group A, member 1 


84753 


23 


2 


Repressed 


3.00E-05 


KIAA0246 protein 


75777 


0 


14 


Induced 


3.66E-05 


transgelin 




0 


14 


Induced 


3.66E-05 


immunoglobulin J polypeptide, linker protein for immuno- 


76325 










globulin alpha and mu potypetides 


1852 


1 


17 


Induced 


4.33E-05 


acid phosphatase, prostate 


78465 


0 


13 


Induced 


9.00E-05 


v-jun avian sarcoma virus 1 7 oncogene homolog 


78344 


1 


16 


Induced 


1.20E-04 


myosin, heavy polypeptide 1 1 , smooth muscle 


83006 


19 


2 


Repressed 


1.50E-04 


CGI-1 39 protein 


263812 


15 


1 


Repressed 


3.19E-04 


nuclear distribution gene C (A. nidulans) homolog 


TACGGGGATA 


0 


11 


Induced 


4.20E-04 


Novel 


GCCTGGGTGG 


11 


0 


Repressed 


6.63E-04 


Novel 


GACTGACACT 


16 


2 


Repressed 


8.40E-04 


Novel 


284296 


0 


10 


Induced 


8.60E-04 


Homo sapiens SURF-4 mRNA, complete cds 


75105 


13 


1 


Repressed 


1.19E-03 


emopamil-binding protein (sterol isomerase) 


128075 


10 


0 


Repressed 


1.30E-03 


ESTs 


154162 


15 


2 


Repressed 


1.43E-03 


ADP-ribosylation factor-like 2 


6895 


1 


12 


Induced 


1.49E-03 


actin related protein 2/3 complex, subunit 3 <21 kD) 


143240 


17 


3 


Repressed 


1.63E-03 


ESTs 


til OD<£ 


u 


y 


induced 


1 .bbb-Uo 


myosin, light polypeptide kinase 


180266 


0 


9 


Induced 


1.66E-03 


tropomyosin 2 (beta) 


93002 


14 


2 


Repressed 


2.53E-03 


ubiquitin earner protein E2-C 


17883 


14 


2 


Repressed 


2.53E-03 


protein phosphatase 1G (formerly 2C), magnesium- 












dependent, gamma isoform 


126023 


14 


2 


Repressed 


2.53E-03 


ESTs, Highly similar to NTC1 JHUMAN NEUROGENIC LOCUS 












NOTCH PROTEIN HOMOLOG 1 PRECURSOR 


69469 


2 


13 


Induced 


2.92E-03 


dendritic cell protein 


77899 


2 


13 


Induced 


2.92E-03 


tropomyosin 1 (alpha) 


285501 


-2 


-43 


—Induced 


3.07E-03 


Human rearranged immunoglobulin lambda light chain mRNA 


119209 


0 


8 


Induced 


3.07E-03 


insulin-like growth factor binding protein 7 


173043 


0 


8 


Induced 


3.07E-03 


metastasis-associated 1 -like 1 


75866 


0 


8 


Induced 


3.07E-03 


dimethylarginine dimethylaminohydrolase 1 


171 695 


0 


8 


Induced 


3.07E-03 


dual specificity phosphatase 1 


TACGGGGATT 


0 


8 


Induced 


3.34E-03 


Novel 


1 72 791 


1 o 




RonroccoH 
ncfjfcoocU 




uuiquiiuusiy-eApresseu iransenpx 


13561 


13 


2 


Repressed 


4.19E-03 


ESTs, Weakly similar to dJ37E16.5 {H. sapiens] 


98260 


8 


0 


Repressed 


4.34E-03 


ESTs 


CATAAGACTT 


8 


0 


Repressed 


4.64E-03 


Novel 


TACGGGGACA 


2 


12 


Induced 


5.76E-03 


Novel 


180034 


12 


2 


Repressed 


6.91 E-03 


cleavage stimulation factor, 3' pre-RNA, subunit 3, 77kD 


5753 


12 


2 


Repressed 


6.91 E-03 


inosito(myo)-1 (4)-monophosphatase 2 


278941 


10- 


1 


Repressed 


6.94E-03 


PRO0628 protein 


19500 


0 


7 


Induced 


7.25E-03 


nuclear localization signal deleted in velocardiofacal syndrome 
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Table 3. Continued 



UGD or Tag 


CAP 


DHT 


Chanpe 


P value 


Description 




o 


7 


li IUUwvU 


7.25E-03 


VAMP f\/p < 5ic:lfi-a < isociatpd mem bran a nroipinY-a^sof:i atari 

V t\\ wit livglwlw uNUwlfllCU 1 1 Iwl 1 1 CU IO yJl vvwll If UvwVv 1 CI LvU 












protein A (33kD) 


26471 


0 


7 


Induced 


7.25E-03 


Homo sapiens clone HQ0692 


AACTGCTGGC 


12 


2 


Repressed 


7.63E-03 


Novel 


GCTGACCGTC 


7 


o 


Repressed 


8.59E-03 


Novel 


CCCCCCTGTC 


7 


0 


Reoressed 

l lvpi WVWVVI 


8 59E-03 


Novel 


9 ASK Q01 


2 




II IUUUOU 


ft fl°F-0rt 


Hnmn <^n/pn<: cDNA Fl J?07^8 fi<5 rlnnp HFP0ft oc i7 


1 1 q nnn 


1 
i 


q 


InHi ir»oH 

II IUUI/CU 


q 1 5>F-n°. 


£is*tinfn alnha 1 
clwLlMllif aipiid l 


7A 5Qfi 


i 
i 


Q 




Q 1°F-fV* 


proioasumo \prosumo, rnacropain^ ouounii, ueia lype, o 


1 71 0^*? 


/ 


u 


nepresseu 


Q fl1 F-H4 

y.o i c-uo 


iropntnin associaieu proiein \iasiin^ 


1 on C/ic 




u 


nepressea 


O C1C no 

y.oi t-uo 


nomo sapiens mniNA Tor nypoineucai protein (i r\z/u\o genej 


77 *7H ft, 
f / 119 


7 
1 


U 


Repressed 


y.oit-uo 


gamma-glutamyl carboxylase 


4877 


7 

7 


0 


Repressed 


9.61 c-03 


CGI-51 protein 


22795 


7 


0 


Repressed 


9.61 E-03 


ESTs 


79335 


7 


0 


Repressed 


9.61 E-03 


Homo sapiens SWl/SNF-related, matrix-associated, actin- 












dependent regulator of chromatin D1 (SMARCD1) mRNA 


i iy l / / 


Q 

y 


1 


Repressed 


1 ore no 


ADP-ribosy latin factor 3 




Q 

y 


1 


nepressea 


i ooc no 


Novel 


182217 


0 


6 


Induced 


1 .32E-02 


succinate-CoA ligase, ADP-forming t beta subunit 


75106 


0 


6 


Induced 


1.32E-02 


clusterin (complement lysis inhibitor, SP-40, 40, sulfated gly- 












coprotein 2, testosterone-repressed prostate message 2) 


2563 1 1 


0 


O 


induced 


1 .32c-02 


granin-like neuroendocrine peptide precursor 


1 03 1 80 


0 


6 


Induced 


132E-02 


DC2 protein 


TACGGGGATG 


1 


8 


Induced 


w 7ftC ftft 

1 .70E-02 


Novel 


AAAUAAAI OA 


2 


10 


induced 


1 .70E-02 


Novel 


1 OC C/l /I 


o 

£. 


1 n 
1U 


Induced 


■* 7<c no 


Uo Dox-containing wu protein 


onoe 

oUob 


e\ 

C. 


1 n 


induced 


4 7<c no 
1.71 fc-U^ 


glioblastoma overexpressed 




o 


1 n 


Induced 


i "7i c no 
1./ ic-uc 


to I s, weaKiy similar to oimnar to o. cerevistae nypotneticai 












nrntAin 1 °.1 1 1 

pi U l will U-%J III 




a 

D 


n 

u 


nopresseu 




iNovei 




D 


u 


nepressea 


1 7RC no 


Novel 


AUA 1 OA 1 U Ava 


O 


U 


Repressed 


■I 7cc no 


iNovei 


CCAGTCCAAG 


6 


0 


Repressed 


1 .75E-02 


Novel 


54 842 


1 


8 


induced 


1 .78E-02 


ESTs 


1526 


1 


8 


Induced 


1.78E-02 


ATPase, Ca++ transporting, cardiac muscle, slow twitch 2 


7911 


1 


8 


Induced 


1 .78E-02 


KIAA0323 protein 


o o"7 ftn 

227 400 


1 


8 


Induced 


1 .78E-02 


rnitogen-activated protein kinase kinase kinase kinase 3 


77269 


1 


8 


Induced 


1 .78E-02 


guanine nucleotide binding protein (G protein), alpha inhibiting 












activity polypeptide 2 


7943 


6 


0 


Repressed 


1 .81 E-02 


RPB5-mediating protein 


278 544 


6 


0 


Repressed 


1 .81 E-02 


acetyl-Coenzyme A acetyftransferase 2 (acetoacetyl 












coenzyme m inioiase^ 


23111 


6 


0 


Repressed 


1.81 E-02 


phenylalanine-tRNA synthetase- like 


31442 


6 


0 


Repressed 


1.81 E-02 


RecQ protein-like 4 


38628 


6 


0 


Repressed 


1.81 E-02 


hypothetical protein 


77422 


6 


0 


Repressed 


1.81 E-02 


proteolipid protein 2 (colonic epithelium-enriched) 


211973 


6 


0 


Repressed 


1.81 E-02 


homolog of Yeast RRP4 (ribosomal RNA processing 4), 












3'-5'-exoribonuclease 


171075 


6 


0 


Repressed 


1.81 E-02 


replication factor C (activator 1) 5 (36.5kD) 


89781 


6 


0 


Repressed 


1.81 E-02 


upstream binding transcription factor, RNA polymerase I 


154149 


6 


0 


Repressed 


1.81 E-02 


apurinic/apyrlmidinic endonuclease(APEX nuclease)-like 2 



protein 
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Table 3. Continued 



Uou or i ag 


l/nr 


Un i 


L*nange 


D Will 1 A 

v value 


uescnpiion 


279772 


8 


1 


Repressed 


2.28E-02 


brain specific protein 


205091 


8 


1 


Repressed 


2.28E-02 


ESTs, Weakly similar to WW domain binding protein 1 1 
(M. muscuius) 


75658 


8 


1 


Repressed 


2.28E-02 


phosphorylase, glycogen; brain 


GGGCAGCTGT 


8 


1 


Repressed 


2.37E-02 


Novel 


105440 


0 


5 


Induced 


2.43E-02 


hepatocyte nuclear factor 3, alpha 


118244 


0 


5 


Induced 


2.43E-02 


protein phosphatase 2, regulatory subunit B (B56), delta iso- 
form 


82389 


0 


5 


Induced 


2.43E-02 


CGI-1 18 protein 


31638 


0 


5 


Induced 


2.43E-02 


restin (Reed-Steinberg cell-expressed intermediate filament- 
associated protein) 


12013 


0 


5 


Induced 


2.43E-02 


ATP-binding cassette, sub-family E (OABP), member 1 


12797 


0 


5 


Induced 


2.43E-02 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 16 


7736 


0 


5 


Induced 


2.43E-02 


hypothetical protein 


274479 


0 


5 


Induced 


2.43E-02 


NME7 


153138 


0 


5 


Induced 


2.43E-02 


origin recognition complex, subunit 5 (yeast homolog)-like 


16034 


0 


5 


Induced 


2.43E-02 


ESTs 


270072 


0 


5 


Induced 


2.43E-02 


ESTs 


187035 


0 


5 


Induced 


2.43E-02 


ESTs 


140452 

- 


0 


5 


Induced 


2.43E-02 


cargo selection protein (mannose 6 phosphate receptor bin- 
ding protein) 


182265 


0 


5 


Induced 


2.43E-02 


keratin 19 


19762 


0 


5 


Induced 


2.43E-02 


ESTs, Weakly similar to unknown (D. melanogastei) 


317 


0 


5 


Induced 


2.43E-02 


topoisomerase (DNA) 


6236 


0 


5 


Induced 


2.43E-02 


ESTs 


GCTGGAGCCT 


5 


0 


Repressed 


3.03E-02 


Novel 


CCAGTGCTCA 


5 


0 


Repressed 


3.03E-02 


Novel 


ACCCTACATA 


5 


0 


Repressed 


3.03E-02 


Novel 


GGGGAAATCT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACTGGTACTG 


5 


0 


Repressed 


3.03E-02 


Novel 


GCTCCGGTGT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACAGTGGTGA 


5 


0 


Repressed 


3.03E-02 


Novel 


7869 




7 


Induced 


3.04E-02 


lysophosphatidic acid acyltransferase-delta 


12101 


1 


-7 
1 


Induced 


3.04E-02 


hypothetical protein 


266940 


; 


7 


Induced 


3.04E-02 


t-compiex-associated-testis-expressed 1-like 1 


6196 




7 


Induced 


3.04E-02 


integrin-linked kinase 


366 


1 


7 


Induced 


3.04E-02 


6-pyruvoyltetrahydropterin synthase 


173611 


1 


7 


Induced 


3.04E-02 


NADH dehydrogenase (ubiquinone) Fe-S protein 2 (49kD) 
(NADH-ceonzyme Q reductase) 


102469 




7 


Induced 


3.04E-02 


putative nuclear protein 


78825 




7 


Induced 


3.04E-02 


matrin 3 


284465 


; 


7 


Induced 


3.04E-02 


ESTs 


30738 


1 


7 


Induced 


3.04E-02 


hypothetical protein FU10407 


78687 




7 


Induced 


3.04E-02 


neutral sphingomyelinase (N-SMase) activation assoviated 
factor 


92381 


1 


7 


Induced 


3.04E-02 


nudix (nucleoside diphosphate linked moiety X)-type motif 4 


242039 


5 


0 


Repressed 


3.26E-02 


EST 


4766 


5 


0 


Repressed 


3.26E-02 


DKFZP586O01 20 protein 


283109 


5 


0 


Repressed 


3.26E-02 


hypothetical protein DKFZp762L1710 


192853 


5 


0 


Repressed 


3.26E-02 


ubiquitin-conjugating entyme E2G 2 (homologous to yeast 
UBC7) 


153678 


5 


0 


Repressed 


3.26E-02 


reproduction 8 
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Table 3. Continued 



UGDorTaa 


CAP 


DHT 


Change 


P value 


Description 


25131V 


5 


0 


Repressed 


3.26E-02 


EST 


279623 


5 


0 


Repressed 


3.26E-02 


selenoprotein X 


150319 


5 


o 


Repressed 


3.26E-02 


ESTs 


102456 


5 


0 


Repressed 


3.26E-02 


survival of motor oroteln interact! n a Drotein 1 


284250 


5 


0 


Repressed 


3.26E-02 


AD-003 protein 


251871 


5 


0 


Repressed 


3.26E-02 


CTP synthase 


270480 


7 


1 


Repressed 


3.92E-02 


ESTs, Weakly similar to ALU5_HUMAN ALU SUBFAMILY SC 
SEQUENCE 


26655 


7 


1 


Repressed 


3.92E-02 


glucose-6-phosphatase, transport (glucose-6-phosphate) 
protein 1 


8118 


7 


1 


Repressed 


3.92E-02 


leukotriene A4 hydrolase 


CTCCGCCGGC 


7 


1 


Repressed 


4.47E-02 


Novel 


AGGAAATGCT 


7 


1 


Repressed 


4.47E-02 


Novel 


GCTGACCGAGG 7 


1 


Repressed 


4.47E-02 


Novel 


CCCATAGTCC 


7 


1 


Repressed 


4.47E-02 


Novel 



CAP: LNCaP cells without dihydrotestosterone 

DHT: LNCaP cells treated with 10" 9 dihydrotestosterone for 24 h 

UGD: unigene ID 

Numbers in CAP and DHT columns are the number of tags observed in respective samples 



3.2 Confirmation of SAGE data 

To confirm the differential expression pattern at the RNA 
level, semiquantitative RT-PCR assay was performed on a 
group of selected genes. One \ig of total RNA was reverse 
transcribed into cDNA with oligo cfl cDNA was amplified by 
PCR for various cycles. PCR products were separated on 
agarose gets and quantified by densitometry. PCR ampli- 
fication curves were plotted for each gene, and data within 
the logarithmic phase of amplification were used for quanti- 
fication (Fig. 1 A, B and C). The reproducibility of gene regu- 
lation by androgen was confirmed in three independent 
experiments by monitoring the induction of PSA in the pres- 
ence or absence of cyclohexamide (Fig. 1B). The kinetics 
of gene induction or repression was determined for PSA 
(Hs.171995). clusterin (Hs.75106) and SCMH1 (Hs.fi747fi) 
genes. Typical examples of kinetics are shown in Figure 
1C. PSA was induced at 4-6 h, peaked between 6-20 h, 
and gradually declined after 20 h post-treatment of DHT. 
Clusterin was induced within 0.5-1 h and gradually declined 
after 6-1 2 h. SCMH1 was repressed 2-4 h post-treatment. 
Expression of PSA, clusterin, and SCMH1 was quantified by 
real-time PCR (Fig. 1 D). 

3.3 Changes in protein expression induced 
by DHT 

Protein lysates of LNCaP cells cultured in the presence of 
DHT for 72 h and cells cultured in parallel in the absence 
of DHT were subjected to 2-D PAGE (Fig. 2). Following sil- 



ver staining, gels were digitized prior to computer-based 
matching and quantitative analysis, as previously des- 
cribed [17]. Of a total of 1031 protein spots matched 
between 2-D patterns of DHT treated and untreated cells, 
a set of 32 protein spots increased in intensity by at least 
two-fold in two independent experiments. Likewise, a set 
of 1 2 protein spots decreased in intensity by at least two- 
fold in these two independent experiments. A two-fold 
change in integrated intensity by silver staining represents 
on average a three-fold change in amount of protein, 
based on prior quantitative studies [17]. 

The 44 protein spots that changed in intensity as a result 
of DHT treatment were excised from the gels, digested 
with trypsin, and subsequently analyzed by MALDI-TOF 
MS. The resulting spectra were used to identify the pro- 
teins using the MS-Fit search program. Of the 44 spots 
excised from the gels, 29 were identified without ambi- 
guity and consisted of 21 up-regulated and eight down- 
regulated proteins (Table 4). The identified proteins repres- 
ented a heterogeneous group that included cytoskeletal 
proteins (e.g. tropomyosin, a tubulin), metabolic enzymes 
(e.g. adenine phosphoribosyl transferase, p 1 ,4 galactosyl 
transferase, galactocerebrosidase), and the products of 
previously described androgen responsive genes (e.g. 
SRY, selenium binding protein) [18, 19]. Specific anti- 
bodies confirmed the identification based on MS for all 
proteins analyzed by Western blotting, which included a 
tubulin, myosin light chain isoforms, nucleoside diphos- 
phate kinase A, glyceraldehyde 3-phosphate dehydro- 
genase (G3PD), and tropomyosin (data not shown). 
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4-/ 




Fiowe 1. Kinetics end qur?n?i- 
fication of androgen-reguiated 
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Corresponding SAGE data were available for most of the 
proteins affected by DHT treatment that were identified. It 
was therefore of interest to determine If the changes 
observed at the protein levels were matched by concor- 
dant changes at the mRNA level. Remarkably, for most of 
the proteins identified, there was no appreciable concor- 
dant change at the RNA level (Table 4). 

4 Discussion 

We have compared the gene expression profile of 
LNCaP, an androgen responsive prostate cancer cell 
line, witn or without androgen (DHT) treatment. Approxi- 
-roately 350 of 16570 expressed genes detected at the 
RNA level were affected by dihydrotestosterone at the 
p<0.05 level. The DHT responsive genes included 
known genes, ESTs, and novel genes. As expected, we 
saw an induction of genes that are well known to be 
regulated by androgens. For instance, we found a 1.7- 
fold induction in kallikrein 3/PSA, a 7.6-fold induction in 
prostatic kallikrein 2, and a 15.7-fold induction in pro- 
static acid phosphatase by DHT in LNCaP cells. We 
also saw a five-fold induction in NKX3.1/NKX3A and 
three-fold in fatty acid synthase; two previously identified 
androgen-reguiated genes [20, 21], Interestingly p-micro- 
seminoprotein, reported to be reduced or lost in prostate 
tumor [22], was up-regulated more than 40-fold in 



LNCaP cells by DHT. More significantly, our data indicate 
that genes involved in a variety of tumor cell functions 
such as growth, apoptosis, and metastasis, are directly 
or indirectly regulated by androgens. In addition, it is 
noteworthy that both p-actin and G3PD, the two most 
frequently used loading controls, were up-regulated 
approximately two-fold by DHT (p<0.01). Thus, these 
two genes may not be appropriate controls in some 
experiments. 

Serial analysis of androgen-reguiated gene expression 
provides us with a list of candidate genes. However, 
many factors such as the dose of DHT and the time of 
cell exposure to DHT and other unknown experimental 
variations will affect the level of gene expression. There- 
fore, it is important to confirm the SAGE results with alter- 
native methods. We performed semiquantitative RT-PCR 
(Fig. 1 B) on approximately 20 genes and real-time quanti- 
tative PCR on 10 genes in at least three independently 
DHT treated LNCaP samples. We noticed experimental 
variations in every gene determined. Some genes such 
as PSA (Fig. 1B) have smaller variations and some have 
larger variations. In addition, the fold of alteration in 
expression identified by SAGE is different for some genes 
compared to that identified by real-time quantitative PCR. 
This is probably due to the technical limitation of SAGE. 
Serial analysis of gene expression is highly quantitative 
for genes whose tags are detected at large numbers. 
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Figure 2. 2-D PAGE patterns of 
untreated (A) and LNCaP cells 
treated with DHT for 72 h (B). 
Identified proteins that were 
reduced in levels following DHT 
treatments are shown in (A) and 
induced proteins are shown in 
(B). Numbered spots represent 
either proteins that were chan- 
ged in level that have not been 
identified or isoforms of identi- 
fied proteins. 
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Table 4. List of proteins that are altered by DHT<s= 2-fold) 
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Numbers in protein are the intensity of protein spots 

Numbers in mRNA columns are the number of SAGE tags detected 

ND: not detected 

|: increase; |: decrease; equal 
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When the tag number detected for a given gene is close 
to zero, the quantitative nature of SAGE is compromised. 
Fzr ir.'-tpnc 1 ?. \r Epprcvimately 60 COO transcripts of eor.h 
sample, ihe t6a tag was detected 42 times in LNCaP and 
70 times in LNCaP treated with DHT However, the tag for 
clusterin was detected 0 times and 6 times, and the tag 
for SCMH1 5 times and 0 times in the corresponding sam- 
ples. Therefore, the SAGE data of PSA is more reliable 
than that of clusterin and SCMH1 . Indeed, SAGE data of 
PSA is virtually identical to that of real-time quantitative 
PCR, whereas the SAGE data of clusterin and SCMH1 dif- 
fer from that of quantitative PCR (Fig. 1 D). 

PSA is probably the best-known androgen-regulated 
gene. We were surprised to see only 1 .7-fold induction 
by DHT. However, kinetics experiments indicate that PSA 
is induced at 4-6 h, peaked between 6-20 h ( and gradu- 
ally declined 20 h post-treatment of DHT. Since the SAGE 
experiment was done in samples treated for 24 h with 
DHT, PSA mRNA level was likely to have declined from 
its peak level. Indeed, we could detect 5-10 fold induction 
of PSA at 6-8 h post-treatment. Clusterin was reported as 
an androgen-repressed gene in the rat prostate [23]. 
Recent evidence indicates that clusterin may not be 
directly androgen-repressed, but regulated by apoptotic 
stimuli [24]. Our results suggest that clusterin was 
induced within 0.5-1 h, gradually declined after 6-12 h, 
and after 24 h reduced to a lower level than that of 
untreated cells (Fig. 1 C). 

Another important question is how many of the andro- 
gen-regulated genes identified are directly induced or 
repressed by androgens. In order to address this ques- 
tion, detailed analyses must be done for each gene. As 
the first step, we will divide the androgen-regulated 
genes identified in this study into two groups; the cyclo- 
hexamide-sensitive group whose induction or reduction 
of expression by DHT is blocked by the protein synthesis 
inhibitor and the cyclohexamide-insensitive group whose 
induction or reduction is not affected by cyclohexamide. 
We consider that genes in the latter group are directly 
regulated by DHT. Preliminary results suggest that 
approximately 20% of the 20 genes studied by RT-PCR 
are cyclohexamide-insensitive. Experiments are under 
way to determine the cyclohexamide sensitivity of all of 
the 149 genes listed in Table 3. Further characterization 
of androgen-regulated genes may provide some clues 
on the transition from hormone sensitive to hormone 
refractory prostate cancer. 

A relatively small set of genes could be analyzed at the 
protein level, largely due to the limited sensitivity of 2-D 
PAGE. Nevertheless, a substantial number of detected 
proteins (44 of 1031 proteins analyzed, 4.3%) were 
affected by DHT treatment. Some of the proteins in this 



subset that were identified included the products of 
genes that were previously shown to be affected by 
androgens, narr.e'y sc-te"ium bindino cfo'ein, brain ?dc*^- 
fic creatine kinase, SHY protein, b<:3 andGoPD. Using 
a subtractive approach, the human selenium-binding 
protein gene was shown to be differentially expressed 
in LNCaP and reversibly down-regulated by exogenous 
androgen in a concentration-dependent manner, in con- 
cordance with our findings at the protein level [19]. An 
increase in levels of the brain specific creatine kinase B 
chain in response to androgen has also been described 
[25]. Likewise the SRY gene has been found to be respon- 
sive to androgen stimulation in LNCaP cells [18]. Also, a 
number of glycolytic enzymes including G3PD which 
were affected at the protein level have been found to be 
responsive to androgen stimulation [26]. Androgenic reg- 
ulation of the amount and phosphorylation of the protein 
B23, included in our list of affected proteins, has been 
previously described and found to be related to the early 
changes associated with androgen mediated growth of 
the prostate [27], 

Corresponding SAGE data was available for most of the 
proteins affected by DHT treatment that were identified. 
Interestingly, for most of the proteins identified, there 
was no appreciable concordant change at the RNA level. 
There are several potential explanations for this lack of 
concordance. A particular gene may be represented by 
more than one protein isoform in 2-D gels. For example, 
in the identified group, three proteins (myosin light chain, 
G3PD and selenium binding protein) were represented 
by more than one isoform. Thus, a source of discordance 
between RNA and protein data may be that the protein 
change is limited to a particular isoform of a protein and 
not to overall protein products of a particular gene. 
Nevertheless, a change in a particular isoform is inform- 
ative and biologically meaningful and may not be pre- 
dictable from RNA data. A lack of concordance between 
RNA and protein data may also reflect either translational 
c on t rol , po sH r a n s l a tional modif ica tion s, or cha n g e s i n 
protein turnover due to DHT treatment. Yet another 
explanation for a lack of concordance could be a lag 
time for changes at the RNA level to be reflected in a 
protein change. It follows from the above considerations 
that monitoring gene expression at both RNA and protein 
levels may provide complementary information that could 
not be ascertained by solely measuring RNA or protein. 

We would like to thank Dr. Ken Kinzler (Johns Hopkins 
University) for providing SAGE software. This study is par- 
tially supported by DAMD1 7-98-1 -8501, R01CA74927 
and a fund from Karmanos Cancer Institute. 
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Based on biomedical literature databases, we tried a first step for constructing a gene expression 
"data warehouse" specific to human colorectal cancer (CRC). Results of genome-wide tran- 
scriptomic research were available from 12 studies, using various technologies, namely, SAGE, 
cDNA and oligonucleotide arrays, and adaptor-tagged amplification. Three studies analyzed CRC 
cell lines and nine studies of human samples. The total number of patients was 144. Out of 982 
up- or down-regulated genes, 863 (88%) were found to be differentially expressed in a single 
study, 88 in two studies, 22 in three studies, 7 in four studies, and only 2 genes in sue studies. 
Eight large-scale proteomics studies were published in CRC, using 2-D-, SDS- or free-flow elec- 
trophoresis, involving only 11 patients. Out of 408 differentially expressed proteins, 339 (83%) 
were found to be differentially expressed only in a single study, 16 in three studies, 10 in four 
studies, 3 in five, and 1 in eight studies. Confirmation at proteome level of results obtained with 
large-scale transcriptomics studies was possible in 25%. This proportion was higher (67%) for 
reproducing proteome results using transcriptomics technologies. Obviously, reproducibility and 
overlapping between published gene expression results at proteome and transcriptome level are 
low in human CRC. Thus, the development of standardized processes for collecting samples, 
storing, retrieving, and querying gene expression data obtained with different technologies is of 
central importance in translational research. 
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1 Introduction 

Translational research in human colorectal cancer (CRC) is 
applying a large spectrum of molecular biology, cellular bi- 
ology, and advanced validation tools. In particular, genome- 
wide techniques are now applied to decipher modifications 
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in gene expression. In recent years, transcriptomics and 
proteomics tools have been broadly applied in CRC. The 
hope is that the new data obtained will now allow a classifi- 
cation of disease on molecular basis, deep insights into the 
pathophysiology of CRC, prognostic statements, and finally a 
systematic search for diagnostic and therapeutic targets. 

Because of the complexity of the biological system under 
investigation, the most significant contribution of transla- 
tional research in CRC is expected to derive not from the 



analysis of single experiments but trom libraries ot experi- 
ments. In other words, the results obtained so far by trans- 
lational research tools in different clinical and experimental 
.-it-.-d to ix* c<* s.oi-.u.d, l.::;:t i? :.ea, : ;:iu it possii.ie 
synthesized. Thus, based on the biomedical literature data- 
bases, we tried a first step for constructing such *data ware- 
house" specific to human CRC. 



Abbreviations: cDNA, complementary DNA; CRC, colorectal can- 
cer 
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The specific aims of the present study were to screen 
transcriptomics and proteomics studies published in CRC, 
and to elaborate a kind of "meta-analysis of gene expression" 
in CRC. Our hypothesis was that we would be able to deter- 
mine a common set of genes and gene products that are up- 
or down-regulated in human CRC by overlapping results 
obtained by different authors. 



2 Materials and methods 

This is a literature survey of gene expression data published 
in human CRC. A Medline search of reports published in 
English from 1990 to 2004 using the terms "colon cancer" 
and "gene expression" with the limit "human" was performed 
and yielded 1979 articles. All abstracts were reviewed and a 
related article search was performed on appropriate abstracts. 
Articles were selected by a consensus of two reviewers (E.S., 
M.R.) that satisfied these predetermined criteria: sample ori- 
gin (human and/or cell lines) and preparation detailed, tech- 
nology for gene expression studies defined (transaiptomics 
and/or proteomics studies), and quantitative results on over- 
expression or underexpression available. Studies concerning 
single genes or arbitrarily selected genes were discarded. 
Results obtained in animal models were not considered. 

Data were entered into an Excel working sheet (Micro- 
soft, Seattle, USA). Since gene expression data were not 
always obtained with quantitative, but in most cases semi- 
quantitative gene expression analysis technologies, no 
expression values or ratios were entered into the database. 
Only genes reported as being over- or underexpressed were 
entered. No threshold was defined so that some genes 
denned as differentially expressed might have shown only 



marginal differences. Only descriptive statistics are provided, 
which were obtained with the in-built tools of the Excel soft- 
ware. 

Various methods were used for large-scale translational 
research in CRC, including transcriptomics and proteomics 
technologies. Results of genome-wide transcriptomic re- 
search are available from 12 studies, using various technolo- 
gies, namely, SAGE (five studies), complementary DNA 
(cDNA) arrays (five studies), and oligonucleotide array (one 
study), as well as adaptor-tagged amplification (one study). 
Three studies analyzed CRC cell lines and nine other studies 
analyzed human samples. The total number of patients pro- 
filed in all studies was 144. Out of the nine studies with hu- 
man samples, only two studies with a total of only 22 patients 
were performed using purified epithelial cells (one of them 
using cDNA arrays, the other one using SAGE technology). 
An overview of these studies is provided in Table 1. 

Eight large-scale proteomics studies were published in 
CRC using various technologies, namely, 2-D PAGE (six 
studies), SDS-PAGE (one study), and free-flow electrophore- 
sis (one study). In total, all proteome studies in CRC involved 
only 11 patients. An overview of these studies is provided in 
Table 2. 



3 Results 

One thousand two-hundred and forty genes have been 
reported to be dysregulated (up- and/or down-regulated) in 
human CRC, representing about 5% of the 20000-25000 
human genes. 



* The complete dataset can be consulted: 
http://www.chirurgiebethel.de 



Table 1. Large-scale or genome-wide transcriptomics studies in human CRC, using cell lines or human tissues. All studies together 
involved only 144 patients 



Author 



Year Sample 



Preparation 



Method 



Number 
of patients 



Zhang efa/. [18] 1997 

SAGE-NET [191 1997 

Zhang eta/. [181 1997 
Parle-McDermott etai [201 2000 

Yanagawa et a/. [21 ] 2001 

Takemasa et ai [22] 2001 

Nottermann etai [23] 2001 

Buckhaultsera/. [24] 2001 



Normal mucosa and primary tumors 
Primary tumors and cell lines 
Cell lines 
Cell lines 

Paired normal and cancer 

Paired normal and primary tumors 

Paired normal mucosa and primary 

tumors, or adenoma 
Normal mucosa, adenoma and 

primary tumors 



Whole tissue 


SAGE 


2 


Cell lines 


SAGE 




Cell lines 


SAGE 




Cell lines 


SAGE 




Whole tissue 


cDNA 


20 


Whole tissue 


cDNA 


20 


Whole tissue 


Oligonuc- 


22 




leotide array 




Purified epithelial 


SAGE 


6 


cells 






Whole tissue 


cDNA 


27 


Purified epithelial 


cDNA 


16 


cells 






V/hoie tissue 


Adapter-tagged 


11 


Whole tissue 


cDNA 


20 



Birkenkamp etai [25] 2002 

Lin etai [26] 2002 

tVl'jro -3taL\l':\ 20 j 3 

Williams etai [28] 2003 



Paired normal and primary tumors 
Paired normal mucosa and primary 

tumors, or adenoroa 
Normal and cancer 
Paired normal mucosa and primary 

tumors, normal and adenoma 
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Table 2. Large-scale or proteomics studies in human CRC, using cell lines or human tissues. All studies together 
involved only nine patients 



Author 


Publication 


Sample 


Method 


Number 




year 






of patients 


Reymond ef a/. [9] 


1997 


Purified epithelial cells 


2-D PAGE 


1 


Simpson etal. [29] 


2000 


Cell lines 


SDS-PAGE 




Lawrie ef al. [30] 


2001 


Purified epithelial cells 


2-D PAGE 


4 


Simpson etal. [31] 


2001 


Cell lines 


Free-flow electrophoresis 




Medjahed era/. [32] 


2003 


In silico 


2-D PAGE 




Demalte etal. [33] 


2003 


Cell lines 


2-D PAGE 




Stierum era/. [34] 


2003 


Cell lines 


2-D PAGE 




Friedman et al. [35] 


2004 ; 


Human whole tissue 


2-D PAGE 


6 



The vast majority of dysregulated genes in human CRC 
was found using transcriptomics tools: a total of 982 genes 
was found to be differentially expressed in at least one of 
these 12 transcriptomics studies. Out of these 982 genes, 863 
(88%) were found to be differentially expressed only in a 
single study, in other words these results have not been 
reproduced so far. The other findings could be reproduced in 
two or more transcriptomics studies: 88 genes were found to 
be differentially expressed in two studies, 22 in three studies, 
7 in four studies, no gene in five studies, and 2 genes in six 
studies. The most cited genes are listed in Tables 3 {up-reg- 
ulation) and 4 (down-regulation). 

A total of 408 proteins were found to be differentially 
expressed in human CRC in at least one study. Out of these 408 
molecules, 339 (83%) were mentioned in a single study, in 
other words these results could not be reproduced. Differential 
expression of the remaining 70 proteins were mentioned with 
following frequenc: 40 were mentioned in two proteomics 
studies, 16 in three studies, 10 in four studies, 3 in five studies, 
and a single protein in eight studies. The most cited proteins 
are listed in Table 5. It has to be noted that only a single study 
[1] provided differential display protein expression data 
obtained in the human patient, using whole tissue biopsy. 

It is also difficult to reproduce transcriptomics results 
with proteomics tools. Out of 982 genes found to be differ- 
entially expressed in human CRC by genome-wide tran- 
scriptomics technologies (Table 6a), only 177 (18%) have 
been confirmed using proteome technologies. When the 
genes reported to be differentially expressed in three and 
more transcriptomics studies (n » 31) are compared with 
those reported to be differentially expressed in three and 
more proteomics * studies (n« 30), only two genes (actin: 
ACTB_HUMAN and creatin-kinase: KCRB_HUMAN) can 
be matched. Thus, the probability of reproducing a gene 
expression result obtained at transrriptnmp IpvpI is Inw wVipti 



genes can be matched, representing about two-thirds. Thus, 
the probability of being able to reproduce a proteomics result 
using transcriptomics tools is more than three times higher 
than the opposite way (67 vs. 18%). However, it has to be 
noted that in 16% of cases, results remain unclear or even 
contradictory. 

Both in the transcriptomics and proteomics studies, 
many genes and factors were found to be differentially regu- 
lated that obviously do not play a causal role in CRC carci- 
nogenesis. 



4 Discussion 

This study was aimed at screening transcriptomics and pro* 
teomics studies published in CRC, and to elaborate a kind of 
meta-analysis of gene expression in CRC in order to generate 
a data warehouse that would be useful in translational 
research. A significant number of translational research 
studies have been published in CRC, starting in 1997 with 
SAGE technology, followed by an increasing number of array- 
based transcriptomics studies since 2001, and more recently 
by several proteomics studies. This has created a significant 
amount of data which we were then able to compile. 

Our starting hypothesis was that we would be able, to 
determine a common set of genes and gene products that are 
up- or down-regulated in human CRC by comparing results 
obtained by different authors. Uris endeavor was riot very 
successful; obviously, overlapping between published gene 
expression results both at proteome and transcriptome level 
is low in human CRC. In fact, more than 80% of results 
could not be reproduced. Several explanations can be pro- 
vided to explain this lack of reproducibility. 

First, the number of patients included in the studies is 
low (alltogcthcr 14 4 patienta in tranaeriptomics, 11 in p r o te * 



using proteomics technologies. 

In contrast, there is a better reproducibility when prote- 
oses results r::e voriH.e:! ii^rocri atomics tools 
(Table 6b). In fact, when the subset of 30 proteins that are 
consistently (in three studies or more) reported to be dysreg- 
ulated in CRC is compared with transcriptomics results, 20 



omics studies); some high impact publications having been 
conducted on samples of only two patients [18], This is a 

probie:7i b'.j':2U'i<'j ifiteriiiUividuui g^ie-ic vsi-LL-;!. :/ lv.cn ::i 
human CRC [2]. Thus, it is allowed to hypothesize that some 
results attributed to gene deregulation might be caused by 
genetic diversity rather than by cancer-specific traits. 
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Table 3. Results of transcriptomics research in human CRC. Most cited up-regulated genes out of 12 genome-wide or large-scale studies 

TRANSCRIPTOMICS 



Accession 

Number Swiss-Prot Entry Name Name 


Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


Takemasa et al 2001 [22] 


Un et al 2002 [26] 


Parle-McDermott 2000 [20] 


eo 

1 

"3 

eo 
e 

1 


55* 

I 

"ca 

» 

Z 

iL 


Zhang et al 1997 [18] 


1 

75 

(A 

B 
.5 

I 


Notterrnan et al 2001 [23] 


Paired normal and cancer 


Normal and cancer 


N.mucadenoma and pr.tum 


Paired n. and pr.tumors 


Paired n. end pr.tumors 


Paired n.muc and pr.tum or adenoma 


Cell lines 


Nomunuc and Prim.tum 


Primary tumors and cell lines 


Cell line 


Paired njnuc and prtunui and adenoma 


Paired njnuc and pr.tum or adenoma 


a 

o 


Adaptor-tagged 


SAGE 


a 

o 


cDNA 


cDNA 


SAGE 


SAGE 


SAGE 


SAGE 


cDNA 


Oligonucleotide Array 


ARM 1 260 


075363 


IFM2.HUMAN 


Interferon-inducible protein 1-Bd 


1 




1 




1 






1 










AA232508 


Q9Y397 


SAHH.HUMAN 


S-Adenosylhomocysteine hydrolase AHCY) 






1 


1 










1 


1 






AA007218 


Q96QY8 


BGH3.HUMAN 


Transforming growth factor, beta-induzed, 66 kDa 


1 




1 


1 










1 








G01119 




TCPD_HUMAN 


Chaperonin containing TCP-1, subunit 4 (delta) 












1 . 








1 


1 




X75821 


Q04984 


CA11.HUMAN 


C0L1A1 










1 


1 




1 










U22055 


Q96AG0 


TF3A„HUMAN . 


General transcription factor IMA 








1 




1 










1 




U14631 


P80365 


GR0_HUMAN 


GR01 oncogene (melanoma growth stimulating 
activity, alpha) 








1 












1 


1 




M92439 


P42704 


MDP1.HUMAN 


MPDP4, MDP7 microsomal dipeptidase 






1 


1 








1 










AF02S082 


P31947 


MPI2.HUMAN 


M-phase inducer phosphatase 2 






1 




















M86400 


P29312 


PROC.HUMAN 


Pyrroline 5-carboxiiate reductase mRNA 






1 




















L76465 


P15428 


SPRC.HUMAN 


Secreted protein, acidic, cysteine-rich 
(osteonectin) 






1 




















X57352 


Q01628 


Q98AG0 


100 kDa coactivator mRNA 


























X03205 


P62888 


RL30_HUMAN 


50S Ribosomal protein L30 ■ 


























M95787 


Q01995 


RL23.HUMAN 


60s ribosomal protein 123 


























Z25821 


P42126; 


HS9B_HUMAN 


90-kDa HSP 


























U98132 


Q99714 


RLA2.HUMAN 


Acidic ribosomal phosphoprotein P2 mRNA 


























AB028893 


P04643 


CA14.HUMAN 


Alpha-1 chain of collaoen IV 


























X79239 


Q02546 




Apolipoprotein A-l 


























M13934 


P06366 


BMP4_HUMAN 


Bone morphogenetic protein-2B (BMP-2B) 


























M60R54 


P17008 


.HUMAN 


CAS chroroosorci*! s^nrsistion ^ f, rp r !'?r? , ! ! ? 


























M;3S32 


POo/Gtl 


CC3LHUMAN 


COS] an;igen 














r 










X69150 


P25232 


TCPZ.HUMAN 


Chaperonin containing TCP1, subunit 6A(zeta1) 


























L06498 


P17075 


CA21.HUMAN 


Collagen alpha-2 type 1 
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TRANSCRIPTOMICS 





Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


Takemasa et al 2001 [22] 


Un et al 2002 [26] 


Parle-McDermott 2000 [20] 


Zhang et al 1997 [18] 


SAGE-NET et al 1997 [19] 


CO 

1 

To 
% 

CD 

1 


Williams et al 2003 [28] 


Notterman et al 2001 [23] 


Accession 
Number 


Swiss-Prot Entry Name Name 


cDNA 


Adaptor-tagged 


SAGE i 


z 


cDNA 


< 
z 
o 


SAGE 


SAGE 


SAGE 


SAGE 


cDNA 


Oligonucleotide Array 


L04483 


r 3 5265 


AAA 1 1 1 It A A hi 

CQ2_HUMAN 


Complement component C2 


1 


1 






















M64716 


P25111 


R10A_HUMAN 


Csa-19mRNA . 


1 


1 






















X77770 


PQZ383 


R0A1_HUMAN 


DNA binding protein UPI, liver mRNA fragment 


1 


1 






















W52460 


Pi 4798 


TP2A_HUMAN 


DNA topoisomerase II (top2) 


























X55715 


P23396 


IF39_HUMAN 


Eukaryotic translation initialion factor(EIF3)mRNA 




~- 






















M77234 


P49241 


IF2B_HUMAN 


Eukaryotic translation initiation factor 2, subunrt 2 


1 


1 






















M58458 


Pi 2750 


fiGH HUMAN 
u o n_n u ivimiii 


Gamma-glytamyl hydrolase 


























D16992 


P10660 


G3P2JWMAN 


Glyceraldehyde 3 phosphate dehydrogenase 


























M77233 


P23821 


SYGJIUMAN 


Glycyl-tRNA synthetase 


























F16294 


P09058 


RS8.HUMAN 


H19RNA 


























X61156 


P08865 


HS9A.HUMAN 


Heat shock Protein HSP 90-Alpha 


























L36055 


Q13541 


HMGI.HUMAN 


Hmgl mRNA for high mobility group protein 1 


























AA316619 


P04645 


RS23_HUMAN 


Human homolog of yeast ribosomal protein S23 
(D14530I 


























LI 4599 


Hi 5233 


IMD2_HUMAN 


IMP (inostne monophosphate) dehydrogenase 2 


























X79234 


P39026 


IFMI.HUMAN 


Interferon induced transmembrane protein 1(9- 
27) 


























L06505 


P30050 


LDHB.HUMAN 


Lactate dehydrogenase B(LDH-B) 


























X64707 


P26373 


RS2_HUMAN 


LLRep 3 


























X56932 


P40429 


4F2_HUMAN 


Lymphocyte activation antigen 4F2 large sub unit 


























L25899 


P39030 


NPL1_HUMAN 


Nucleosome assembly protein Mike 1 


























X63527 


P14118 . 


GSHH.HUMAN 


Phospholipid hydroperoxide glutathione per- 
oxidase 
























X89401 


P46778 


GDFFJWMAN - 


Prostate differentation factor 


























H59771 


P23131 


RL6_HUMAN 


Ribosomal protein L 6 


























M 94314 


P38B63 


RL18_HUMAN 


Ribosomal protein L18(RPL18) mRNA 


























LI 9527 


P08526 


RL1X_HUMAN 


Ribosomal protein LI 8a mRNA, complete cds 


























U 14968 


P46776 


RL7.HUMAN 


Ribosomal protein L7 


























U14969 


P46779 


RL8.HUMAN 


Ribosomal protein L8 


























L38941 


P49207 


RS5JWMAN 


Ribosomal protein S5 


























U12465 


P42766 


RS19.HUMAN 


S 19 ribosomal protein 


























F19234 


P18077 


MYC_HUMAN 


V-myc myelocytomatosis viral oncogene 

— homolog (ovain) • — : — : - 


























X66699 


PI 2751 


Q99497 


EST(PARK7;DJ1) 


1 


1 
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Table 4. Results of transcriptomlcs research in human CRC. Most cited down-regulated genes out of 12 genome-wide or large-scale studies 



TRANSCRIPTOMICS 



Entry Name Name 


Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


I 

E 

IS 

c 

0> 

5 


Takemasa et al 2001 [22] 


Un et al 2002 [26] 


Parle-McDermott 2000 [20] 


Zhang et al 1997 [18] 


5» 

i 

To 

t 

z 

lit 

s 


Zhang et al 1997 [18] 


Williams et al 2003 [28] 


Notterman et al 2001 [23] 


Paired normal and cancer 


Normal and cancer 


Njnuc.adenoma and pr.tum 


Paired n. and pr.tumors 


Paired n. and pr.tumors 


Paired njnuc and pr.tum or adenoma 


Cell lines 


Nomunuc and Prim.tum 


Primary tumors and cell lines 


Cell line 


Paired njnuc and pr.tumji and adenoma 


Paired njnuc and pr.tum or adenoma 


i 

a 

o 


Adaptor-tagged 


SAGE 


s i 

a 

u 


cDNA 


cDNA 


SAGE 


SAGE 


SAGE 


SAGE 




Oligonucleotide Array 


CAH2_HUMAN 


Carbonic anhydrase II 


1 " 




1 




1 












i 


1 


CEAI.HUMAN 


EST (BILIARY GLYCOPROTEIN) 






1 


1 


1 












i 


1 


BENE_HUMAN 


BENE 


1 








1 






1 








1 


CAH1JWMAN 


Carbonic anhydrase 1 


1 




1 


















1 


GUAU_HUMAN 


GCAP-II guanylate cyclase activator 2B; Uroguanylin; UGN 


1 






















1 


FABL.HUMAN 


L-FABP liver fatty acid-binding protein 1 


1 








1 
















ACDS_HUMAN 


Acyl-CoenzymeA dehydrogenase C-2 to C-3 short chain 






















i 


1 


ADHGJWMAN 


ADH gama 2 subunit (aa-1-375) 










1 














1 


"T4S1-HUMAN 


Co-029; transmembrane 4 superfamity member 3 






1 




















DRA_HUMAN 


Colon mucosa-associated (DRA) 
























1 


KCRBJWMAN 


Creatine kinase 


1 
























GUAN.HUMAN 


GUCA1B guanulate cyclase activator 1B/guanyiin 
























1 


095784 


IgG Fc-binding protein 


1 








1 
















ITMC_HUMAN 


Integral membrane protein 2C 










1 


1 














MK03.HU MAN 


MAPK3 






















i 




FXY3.HUMAN 


MAT8 protein 










1 
















ACTB_HUMAN 


mRNA fragment encoding cytoplasmic actin 


1 












1 


1 










SELP_HUMAN 


Selenoprotein P 










1 










1 






EST1.HUMAN 


Liver carboxylesterase 1 [Precursor] 


















1 


1 




1 


CFAD_HUMAN 


Adipsin, complement factor DCDA (EST) 
























1 


ATPB_HUMAN 


ATP5B ATP synthase 










1 
















CAHC.HUMAN 


CA12 






1 1 1 1 
















C-;i:ijr;i 'lop'jr d'riii je U.ne.'l suhur.i:} 
















1 








CALMJWMAN 


Calmodulin-I (CALM1) mRNA, 3' UTR, partial sequence 
















1 






i 




CAH4_HUMAN 


Carbonic anhydrase IV 








1 
















1 
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TRANSCRIPTOMICS 



Entry Name Name 


Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


Btrkenkamp et al 2002 [25] 


Takemasa et al 2001 [22] 


Un et al 2002 [26] 


Parte-McDermott 2000 [20] 


Zhang et al 1997 [18] 


SAGE-NET et al 1997 [19] 


Zhang et al 1997 [18] 


Williams et al 2003 [28] 


Notterman et al 2001 [23] 


cDNA 


Adaptor-tagged 


SAGE 


cDNA 


cDNA 


cDNA 


SAGE 


SAGE 


SAGE 


SAGE 


cDNA 


Oligonucleotide Array 


[WIT IF M11MAM 
IVI 1 1 C_n U (VIMIM 


^nhlA similar tA hJIflQAO hnitiAn 

cuimm similar lo go. ivi iu«wi_cqs i numan 


















i 
i 






i 




CES 2 carboxylesterase 2 








— j— 


1 
















CMGA HUMAN 


CgA 
























i 
■ 


CLUS.HUMAN 


clusterin 
























1 


K1CT.HUMAN 


Cytokeratin 20 


















1 






1 


KDGA_HUMAN 


Diacylglycerol kinase 
























1 


DTD_HUMAN 


DTD sulfate transporter 










1 
















PLA8_HUMAN 


EST 122594 5_ 


















1 






1 


ATPAJWMAN 


Fl-ATPase alpha subunrt 










1 
















LEG3.HUMAN 


Galectin-3 (Galactose-specific lectin 3) (MAC -2 antigen) 










1 
















ABP_HUMAN 


HP-DA01 (diamine oxidase) 




















1 




1 


K1CS_HUMAN 


Keratin 19 












1 








1 






K2C8_HUMAN 


Keratin, type II cytoskeletal 8 (cytokeratin 8) (K 8) (CK 8) 














1 






1 






DHB2_HUMAN 


L1 1708 Estradiol-17 beta- dehydrogenase 2 












1 












1 


MT1H.HUMAN 


Metallothionein 1H 
























1 


MUC2.HUMAN 


Mucin 2, intestinalftracheal 














1 


1 










MEPA_HUMAN 


PPH alpha gene 




















1 




1 


PA2A.HUMAN 


RASF-A PLA2 gene 












1 










1 




MT1F.HUMAN 


RNA helicase-related protein 
























1 


SBPIJWMAN 


SBP selenium-binding protein 












1 


1 












MT1LHUMAN 


Serine theonine kinase 39 (STE20/SPS1 homolog,yeast) 












1 












1 


MYL6_HUMAN • 


Myosin light polypeptide 6 


















1 




1 




TETN_HUMAN ; 


TNA tetranectin 












1 












1 


VIPRJWMAN 


VIPR1 vasoactive intestinal polyoeptide receptor 1 












1 












1 


PLA8.HUMAN 


Placenta-specific gene 8 protein 




















1 




1 


Q9NXM9 


Hypothetical protein FU20151 












1 












1 


T4S1.HUMAN 


Transmembrane 4 superfamily member 1 








1 












1 






ALU2.HUMAN 


Alu subfamily SB sequence contamination warning entry 






1 






1 














S116_HUMAN 


SI 00 calcium-binding protein A16 


















1 


1 






C14A_HUMAN 


Dual specificity protein phosphatase CDC14A 




















1 


1 





Sernnrlj resu l ts have hern obtained using h e t e rog e n e ous 



colonic crypto wore compared with the CRC cell line LIM ■ 



samples in particular cell lines, whole tissue biopsies, and 
epithelial cells purified from surgical specimens. Results 



allow accurate co:n-:j:;-on be- 



tween normal and cancer cells, and the presence/absence of 
proteins of interest has to be confirmed in biopsies. For 
example, when 2-D PAGE protein patterns of normal human 



1863, the proteins spots from normal crypts matched only 
75-80% of the cell line spots and the relative expression 

levels of a large nun-bur of proteins d'Ju.vcJ. [:>\. -.w;en clin- 
ical biopsies are examined, results of phenotypic compar- 
isons depend on the type of samples exaimined (e.g., hetero- 
geneous whole tissue biopsies with inflammatory cells, 
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Table 5. Results of proteomics research in human CRC. Most cited up-regulated (orange), down-regulated (green), or mentioned (yellow) 
proteins out of 12 large-scale proteomics studies 



Accession 
Number 



Swlss-Prot Entry Name 



Name 



AF041260 



AA232508 



AA007218 



X77584 



X75821 



U22055 



U 14631 



M92439 



AF029082 



M86400 



L76465 



X57352 



X03205 



M95787 



Z25821 



U96132 



AB028893 



X79239 



M13934 



M60854 



M13932 



X69150 



L08498 



L04483 



M64716 



X77770 



W52460 



X55715 



M77234 



M58458 



016992 



M77233 



075363 



ACTB.HUMAN 



Q9Y397 



Q96QY8 



G01119 



Q04984 



Q96AG0 



P80365 



P42704 



P31947 



P29312 



P15428 



Q01628 



P10809 



Q01995 



P42126 



Q99714 



P04643 



Q02546 



P06366 



P17008 



P08708 



P25232 



P17075 



P35265 



P25111 



P02383 



P14798 



P23396 



P49241 



P12750 



X61156 



L36055 



P73R21 

F0SJ58 



ACTG.HUMAN 



GTPJUMAN 



THI0.HUMAN 



ANX3 HUMAN 



ANX4_HUMAN 



K1CR_HUMAN 



ATPB_HUMAN 



PPIA.HUMAN 



PDA3.HUMAN 



TPIS_HUMAN 



TCTP.HUMAN 



CH60_HUMAN 



VINC_HUMAN 



EF1GHUMAN 



TBA1 HUMAN 



CATD.HUMAN 



EF11JWMAN 



G3P2.HUMAN 



GR78.HUMAN 



EN0A_HUMAN 



CRTC_HUMAN 



C0F1.HUMAN 



HS7C_HUMAN 



HBBJWMAN 



K2C8_HUMAN 



RM12JWMAN 



GR75.HUMAN 



S0DC.HUMAN 



KCRB.HUMAN 



143Z.HUMAN 



PL4.HIJMAN 



mRNA fragment encoding cytoplasmic actin 



Actin, Cytoplasmatic 2 (Gemma-Actin) 



Glutathione S-transferase M3 (brain) 



Thioredoxin (ATL-derived factor) (ADF) 



Annexin III (Lipocortin HI) 



Annexin IV (Lipocortin IV) 



Keratin,type I cytoskeletal 18 (cyto keratin 18) (K18) (CK18) 



ATP5B ATP synthase 



Peptidyl-Profyl Cis-Trans Isomerase A (Cyclophilin A) 



Probable protein disulfide isomerase ER-60 precursor 



Triosephosphate isomerase (TIM) 



Translationally controlled tumor protein (TCTP)(p23) 



Heat shock protein 60 



Vinculin 



Elongation factor 1 -gamma mRNA 



Tubulin alpha-1 chain, brain specific 



Cathepsin D <EC3.4.23..5).{GENE.CTSD)Homo sapiens 



Elongation factor 1 -alpha, mRNA 



Glyceraldehyde 3 phosphate dehydrogenase 



78 kDa glucose-regulated protein precursor (GRP 78) 



Alpha enolase (2-phospho-P-glycerate hydrolase) 



Calrectlculum precursor (CR55) (calregulin) 



Cofilin, non-muscle isoform 



Heat shock Cognate 71 kDa protein 



Hemoglobin beta chain.{GENE: HBB} h.s 



Keratin, type II cytoskeletal 8 (cytokeratin 8) (K 8) (CK 8) 



Mitochondrial 60s ribosomal protein L7/L12 precursor 



Mitochondrial stress-70 protein precursor (GRP 75) 



Superoxide Dismutase (Cu-Zn) 



Creatine kinase 



14-3-3 protein zeta/delta (protein kinase C inhibitor protein- 1) 



Rn$ n^ncrjmai rjrotsin ! 4 (LI) 



P08865 



Q13541 



ANXLriu'MAN 



Annexin j (lipocofiin Ij (calpaurinh) (chromopincin flj jp35j 



R0A1_HUMAN 



FABl.HUMAN 



DNA binding protein UPI, liver mRNA fragment 



L-FABP liver fatty acid-binding protein 1 
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Simpson et al 2000 [29] 1 


Simpson et al 2001 [31] 


Demalte et al 2003 [33] 


Reymond et al 1999 [9] 


Medjahed et al 2003 [32] 


Lawrie et al 2001 [30] 


*3 
Z 

c 

ca 

e 
£ 


Stienim et al 2003(34] 


Accession 
Number 


Swiss-Prot Entry Name Name 


SDS-PAGE 


Free-flow electrophoresis 


2-D PAGE 


2-D PAGE 


2-D PAGE 


2-D PAGE 


2-D PAGE 


2-D PAGE 


AA316619 


P04645 


PAB1_HUMAN 


Poly(A) binding protein, mRNA 


















L14599 


Q15233 


RL3.HUMAN 


Ribosomal protein L 3 


m 














X79234 


P39026 


RS19_HUMAN 


S 19 ribosomal protein 
















L06505 


P30050 


SAHH.HUMAN 


S-Adenosylhomocysteine hydrolase AHCY) 




if 












X64707 


P26373 


TERA_HUMAN 


Transitional endoplasmic reticulum ATPase (TER ATPase) 














X56932 


P4Q429 


GDIR.HUMAN 


Rho GDP-dissociation inhibitor 1 














L25899 * 


P39030 


ACON.HUMAN 


Aconitate hydratase, mitochondrial precursor (aconitase) 






ii 












X63527 


P14118 


AKALHUMAN 


Alcohol dehydrogenase (NADP+) (aldehyde reductase) 


it 




$M 












X89401 


P46778 


AAC4.HUMAN 


Alpha-Actinin 4 


m 


m 














H59771 


P23131 


NPM.HUMAN 


B23 nucleophosmin 


















M94314 


P38663 


CATA.HUMAN 


Catalase 


«m 
















L19527 


P08526 


CD44_HUMAN 


CD44 antigen {Phagocytic glycoprotein 1) 








m 










U14968 


P46776 


TCPZ.HUMAN 


Chaperonin containing TCP1, subunit 6A(zeta1) 






3 












U 14969 


P46779 


C0XA.HUMAN 


Cytochrome c oxidase polypepti-de Va, mitochondrial (EC 1.9.3.1) 
{GENE:C0X5A}H.s 


















L38941 


P49207 


COXB.HUMAN 


Cytochrome C oxidase polypeptide VB precursor.C0X5B 


m 
















U12465 


P42766 


EF1B_HUMAN 


Elongation factor 1-beta (EF-1-beta) 






I 


m 










F19234 


P18077 


ER29_HUMAN 


Endoplasmatic reticulum protein ERp29 (ERp31) (ERp28). 






M 












X66699 


P12751 


EZRLHUMAN 


Ezrin (p81) (cytovi!lin|(villin-2) 






m 












023660 


P36578 


ALFAJWMAN 


Fructose-Biphosphate Aldolase A (Muscle-Type Aldolase) 


i 


m 














U14966 


P46777 


DHE3_HUMAN 


Glutamate dehydrogenase 1 precursor (GDH) 








m 










X06705 


P11518 


HS9A.HUMAN 


Heat shock Protein HSP 90-Alpha 




m 














D14531 


P32969 


R0L_HUMAN 


Heterogeneous nuclear ribonucleoprotein L (HNRPN L) 


















U30255 


P52209 


K2C1.HUMAN 


Keratin, type II cytoskeletal 1 (cytokeratin 1) (K1) (CK1) 




m 














M81182 


P28288 


PDXLHUMAN 


Peroxiredoxin 1 (EC 1.11.1.-) (Thioredoxin peroxidase 2) 






n 






m 






U04627 


P4C939 


060506 


PRM RNA binding protein Gry-rbp (GRY-RBP)mRNA 


p 
















X87949 


P11021 


PHB_HUMAN 


Prohibitin 


















M16660 


P08238 


PDI.HUMAN 


Protein disulfide isomerase precursor (PDI) 


















U79725 


Q99795 


SPCBJWMAN . 


Spectrin beta chain, erythrocyte (Beta-I spectrin) 






* 












D10511 

1W 


P24752 
P22303 


PDX3.HUMAN 


Thioredoxin-dependent peroxide reductase, mitochondrial 


















AF102542 


095395 


TPM3.HUMAM 
NDKA.HUMAN 


Tropomyosin al^ha 1 3 chairi ( 1 ropomydsHv3J 

Non-meta static cells 1, protein (NM23A)expressed in 










m 






W45148 


P24666 


143S.HUMAN 


14-3-3 PROTEIN SIGMA 



















i 






■ 




M 17886 


P05386 


APA1_HUMAN 


Apolipoprotein A-i 






m 






M17887 


P05387 


DHSA_HUMAN ] 


S0H2 succinate dehydrogenase flavoprotein subunit 


H 
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Table 6. Reproducibility of transcriptome results using proteomics tools (a) and vice versa (b). Proteomics results have been reproduced in 
67% of cases in transcriptomics studies. Results from transcriptomics studies are more difficult to reproduce using proteomics 
tools (25%). Only two gene products can be retrieved in three and more proteomics and transcriptomics studies (actin and creatin- 
kinase) 

a. Transcriptomics > Proteomics 



Accession 

Number Swi$s-Prot Entry Name Name 



AF041260 



AA232508 



AA007218 



X05014 



X75821 



U22055 



U14631 



M92439 



AF029082 



M86400 



075363 



Q9Y397 



Q96QY8 



601119 



004984 



Q96AG0 



P80365 



P42704 



P31947 



P29312 



CAH2.HUMAN 



CEA1.HU MAN 



BENE.HUMAN 



CAH1.HUMAN 



GUAU.HUMAN 



IFM2.HUMAN 



FABL_HUMAN 



SAHH.HUMAN 



BGH3.HUMAN 



ACDS.HUMAN 



Carbonic anhydrase II 



EST (BILIARY GLYCOPROTEIN) 



BENE 



Carbonic anhydrase I 



GCAP-II guanylata cyclase activator 2B; Uroguanylin; UGN 



Interferon-inducible protein T-Bd 



L-FABP liver fatty acid-binding protein 1 



S-Adenosylhomocysteine hydrolase AHCY) 



Transforming growth factor, beta-induced, 66 kDa 



L76465 
X57352 
X03205 
M95787 
Z2S821 
U96132 



mmwMuna:iihw 



Acyl-CoenzymeA dehydrogenase C-2 to C-3 short chain 



AOH game 2 subunit (aa- 1-375) 



APC6 or CDC16H3 cell division cycle 16 



Chaperonin containing TCP-1, subunit 4 (delta) 
Co-0293ransmembrane 4 superfamily member 3 

C0L1A1 

Colon mucosa- associated (DBA) 



AB028893 



X79239 



P04643 



002546 



KCRB_HUMAN 



TF3A.HUMAN 



Creatine kinase 



Ml 3934 
M60854 



General transcription factor MIA 



GR01 oncogenalmelanoma growth stimulating activity, alpha) 
GUCA1B guanulate cyclase activator IB/guanytin 



Ml 3932 P08708 095784 



X69150 
106498 



L04483 



| IgG Fc-binding protein 



Integral membrane protein 2C 



MAPK3 



MAT8 protein 



Mb4/lb 



X77770 



W52460 



VTSTTT 



P02383 



PT4798 



MDP1JWMAN 



MPI2.HUMAN 



ACTB.HUMAN 



MKUK4, MDF7 microsomal dipeptidase 



M-phase inducer phosphatase 2 



mRNA fragment encoding cytoplasmic actin 



> ,( ?"?1r _ ^ 

r : -r!jJ4i | SrftC_nufviAN | jcj/dicJ prjtgin,dciuic, cysteiria-;ich (osteonecunj 



M58458 



D16992 



P12750 



P10660 



SELP.HUMAN 



EST1_HUMAN < EST 



Selenoprotein P 




mm 

mm 

111 



lip 

tt 

mm. 

Ifl 
mm 

til 
mm 

mm 
mm 
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Table 6. Continued 

b. Proteomics > Transcriptomics 



Accession 
Number 



Swiss-Prot Entry Name 



TRANSCRIPTOMICS 



W52460 


P14798 


ACTB.HUMAN 


mRNA fragment encoding cytoplasmic actin 1 


Ml 7887 


P05387 


GTPJWMAN 


Glutathione S-transf erase M3 (brain) | 


AA496678 


P20749 


ACTG.HUMAN 


Actin, Cytoplasmatic 2 (Gamma- Ac tin} 



U53445 




PROTEOMICS 



a 
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necrosis, blood, stool, etc.), so that standardized sample 
preparation procedures are critical for obtaining reproduc- 
ible results. Several sample preparation methods have been 
described, in particular fluorescence-activated cell-sorting 
(FACS) [4], laser capture microdissection (LCM) [5-8], 
immunomagnetic beads separation [9], and cellular frac- 
tionation [10]. Unfortunately, these sample preparation 
procedures were barely applied in CRC. In our experience, 
the beads-based method is characterized by several advan- 
tages when compared with other cell purification proce- 
dures [11]. 

Third, gene expression patterns depend on the arrays 
technology platform. In transcriptomics studies any factors 
may affect the outcome of a microarray experiment, in par- 
ticular technical, instrumental, computational, and inter- 
pretative factors. In fact, lack of reproducibility and accuracy 
is a major concern in microarray studies [12]. When cross- 
platform comparison was performed, reproducibility was 
insufficient: only four genes from a set of 185 common genes 
selected behaved consistently on three array platforms, and 
agreement of about 30% was found between two brands [13]. 

Fourth, in proteomics studies, 2-D PAGE or 2-D DIGE 
have well-known technological limitations. In CRC, even 
after epithelial cell enrichment using magnetic beads, the 
mean CV of repeated 2-D PAGE analysis with silver staining 
was found to lie between 20 and 28%. Only 47% (internal) to 
76% (intrarun) of spots could be matched within a triplicate 
experiment Interindividual phenotypic variability was high. 
Thus, even under well-defined experimental conditions, 2-D 
PAGE parallel analysis of paired CRC samples is hampered 
by a significant variability [2]. 

Fifth, the methods applied for generating, formatting, 
storing, retrieving, and querying data are of outmost impor- 
tance to assess methodological and biological variation in 
gene expression analysis. Unfortunately, due to the small 
sample size {number of patients), large number of variables 
examined at once, and absence of double or triple experi- 
ments (arrays and gels are expensive and samples are rare) 
statistical analysis is often not valid. In particular, assessing 
the reproducibility of a variable is necessary (e.g., using the 
intraclass correlation coefficient) for comparing multiple 
samples at once. The use of median values instead of mean 
values has been shown to improve data correction [14]. It has 
also been proposed to use housekeeping genes as endoge- 
nous controls [15], A dedicated society, The Microarray Gene 
Expression Date (MGED) Society, has been formed to facil- 
itate the sharing of gene expression data generated by func- 
tional genomics and proteomics experiments [16], 

Finally, correlation between results of transcriptomics 
ue.nut prntenmiics results is How. For CRC, there is no pub 
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Obviously, many genes and factors found to be differen- 
tially regulated (both in transcriptomics and proteomics 
studies) do not play a causal role in CRC carcinogenesis. For 
example in the studies under investigation, at least 17 
mRNAs encoding ribosomal proteins were identified to be 
dysregulated using cDNA arrays and 39 ESTs using SAGE 
technology. This broad dysfunction of protein synthesis, in 
particular of small molecules synthesis, has been reported 
not only in cancer but also in several other human diseases, 
where etiologies have been linked to mutations in genes of 
the translational control machinery [36]. 

However, some findings might be of particular interest 
in human CRC. For example, a small molecule group found 
to be dysregulated in human CRC is the 14-3-3 proteins 
family. 14-3-3 proteins are ubiquitous within all eukaryotic 
cells and participate in protein kinase signaling pathways. In 
particular, they are involved in phosphorylation-dependent 
protein-protein interactions that control progression 
through the cell cycle, initiation and maintenance of DNA 
damage checkpoints, activation of MAP kinases, prevention 
of apoptosis, and coordination of integrin signaling and 
cytoskeletal dynamics {37]. Given the prevalence of specific 
14-3-3 isoforms expression in several human epithelial can- 
cers [38, 39], these proteins may be involved in cancer 
tumorigenesis and particular isoforms may be useful as 
therapeutic targets in human CRC. 

In summary, we propose a gene expression data ware- 
house in human CRC that is intended to help researchers 
active in the field to get an overview of the data available. 
However, reproducibility of results obtained in different 
studies was disappointing. 

As a matter of fact, the development of some types of 
unified processes for generating information, formatting, 
storing, retrieving, and querying data, regardless of the tech- 
nology used to generate it, is of central importance for 
building gene expression databases. Such unified processes 
have been proposed (see above) but have found only limited 
recognition so far. This is unfortunate because these metho- 
dological issues must be solved before trying to integrate 
experimental data from different sources into functional 
proteomics studies at the bench or in silico. The present 
meta-analysis of gene expression highlights the need for 
including a sufficient number of patients, analyzing only 
purified epithelial cells, using clinical standards {such as the 
international classification of disease, histopathology, and 
staging), assessing the variability of the technology applied, 
considering common standards for microarray data annota- 
tion and exchange, and finally for developing software 
implementing these standards and promoting the sharing of 
thes e high quality, w e ll annotated data within the life aei — 



lication comparing mRNA and protein expression for a 
cohort of genes. However, extrapolation is reasonable from 

j;i :;:hvr epiihcU:,! cuKer {king adeaocjrdncma), where 
such comparison has been performed. Only a subset of the 
proteins (17%) exhibited a significant correlation with 
mRNA abundance (17]. 



ences community. 

In the absence of such unified processes , reproducibility of 



resits wit.: 



tion of results between different groups will stay difficult, and 
identification of diagnostic and therapeutic targets will remain 
a lottery. These problems are challenging the whole drug dis- 
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covery process from the very beginning, namely target identi- 
fication, so that target validation, assay development, and 
prioritization of compounds remains a high risk endeavor. 



We are very grateful to G. Nestler for her excellent technical 
assistance. 
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